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The present invention relates to novel proteins and processes for obtaining them. These proteins are 
capable of inducing cartilage and bone formation. 

Background 

5 

Bone is a highly specialized tissue characterized by an extensive matrix structure formed of fibrous 
bundles of the protein collagen, and proteoglycans, noncollagenous proteins, lipids and acidic proteins. The 
processes of bone formation and renewal/repair of bone tissue, which occur continuously throughout life, 
are performed by specialized cells. Normal embryonic Jong bone development is preceded by formation of 
70 a cartilage model. Bone growth is presumably mediated by "osteoblasts" (bone-forming cells), while 

remodeling of bone is apparently accomplished by the joint activities of hone-resorbing cells, called 

"osteoclasts” and osteoblasts. A variety of osteogenic, cartilage-inducing and bone inducing factors have 
been described. See, e.g. European patent applications 148,155 and 169,016 for discussions thereof. 

75 Brief Description of the Invention 

The present invention provides novel proteins in purified form. Specifically, four of the novel proteins 
are designated BMP-1, BMP-2 Class I (or BMP-2), BMP-3, and BMP-2 Class II (or BMP-4) wherein BMP is 
bone morphogenic protein. These proteins are characterized by peptide sequences the same as or 

20 substantially homologous to amino acid sequences illustrated in Tables II through VIII below. They are 

capable of inducing bone formation at a predetermined site. These bone inductive factors are further 
characterized by biochemical and biological characteristics including activity at a concentration of 10 to 
lOOOng/gram of bone in an in vivo rat bone formation assay described below. Proteins of this invention may 
b© encoded by the DNA sequences depicted in the Tables or by sequences capable of hybridizing thereto 
25 and coding for polypeptides with bone growth factor biological properties or other variously modified 
sequences demonstrating such properties. 

One of the proteins of the invention is designated BMP-1. A portion of the humanBMP-1 or hBMP-1 is 
characterized by the same or substantially the same peptide sequence as that of amino acid #1 through 
amino acid #37 of Table V, below which represents a genomic hBMP-1 fragment or amino acid #1 through 
30 amino acid #730 of Table VI which represents the hBMP-1 cDNA. hBMP-1 or a related bone inductive 
factor may be further characterized by at least a portion of these sequences. These peptide sequences are 
encoded by the same or substantially the same DNA sequence, as depicted in nucleotide #3440 through 
nucleotide #3550 of Table V and in nucleotide #36 through nucleotide #2225 of Table VI, respectively. 
These hBMP-1 polypeptides are further characterized by the ability to induce bone formation. hBMP-1 
35 demonstrates activity in an in vivo rat bone formation assay at a concentration of 10 to lOOOng/gram of 

bone. 

The homologous bovine growth factor of the invention, designated bBMP-1, is characterized by a 
peptide sequence containing the same or substantially the same sequence as that of amino acid #1 through 
amino acid #37 of Table II below which represents a genomic bBMP-1 fragment. This peptide sequence is 
40 encoded by the same or substantially the same DNA sequence as depicted in nucleotide #294 through 
nucleotide #404 of Table II. The bovine peptide sequence identified in Table II below is also 37 amino acids 
in length. bBMP-1 is further characterized by the ability to induce bone formation. 

Another bone inductive protein composition of the invention is designated BMP-2 Class I (or BMP-2). It 
is characterized by at least a portion of a peptide sequence the same or substantially the same as that, of 
45 amino acid #1 through amino acid #396 of Table VII which represents the cDNA hBMP-2 Class I. This 

peptide sequence is encoded by the same or substantially the same DNA sequence, as depicted in 
nucleotide #356 through nucleotide #1543 of Table VII. The human peptide sequence identified in Table VII 
is 396 amino acids in length. hBMP-2 or related bone inductive proteins may also be characterized by at 
least a portion of this peptide sequence. hBMP-2 Class I is further characterized by the ability to induce 
so bone formation. 

The homologous bovine bone inductive protein of the invention designated bBMP-2 Class I (or bBMP- 
2), has a DNA sequence identified in Table ill below which represents the genomic sequence This bovine 
DNA sequence has a prospective 129 amino acid coding sequence followed by approximately 205 
nucleotides (a presumptive 3' non-coding sequence). bBMP-2, Class I is further characterized by the ability 
55 to induce bone formation. A further bone inductive protein composition of the invention is designated BMP-2 
Class II or BMP-4. The human protein hBMP-2 Class II (or hBMP-4) is characterized by at least a portion of 
the same or substantially the same peptide sequence between amino acid #1 through amino acid #408 of 
Table VIII, which represents the cDNA of hBMP-2 Class II. This peptide sequence is encoded by at least a 
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portion of the same or substantially the same DNA sequence as depicted in nucleotide #403 through 
nucleotide #1626 of Table VIII. This factor is further characterized by the ability to induce bone formation. 

Still another bone inductive factor of the invention, BMP-3, is represented by the bovine homolog 
bBMP-3. bBMP-3 is characterized by the DNA sequence and amino acid sequence of Table IV A and B 
5 which represents the bovine genomic sequence. It is characterized by at least a portion of a peptide 
sequence the same or substantially the same as amino acid #1 through amino acid #175 of Table IV A and 
B. BMP-3 is further characterized by the ability to induce bone formation. The bovine factor may be 
employed as a tool for obtaining the analogous human BMP-3 protein or other mammalian bone inductive 
proteins. The proper characterization of this bovine bone inductive factor provides the essential "starting 
io point" for the method employing this sequence. The method, employing techniques known to those skilled 
in the art of genetic engineering, involves using the bovine DNA sequence as a probe to screen a human 
genomic or cDNA library; and identifying the DNA sequences which hybridize to the probes. A clone with a 
hybridizable sequence is plaque purified and the DNA isolated therefrom, subcloned and subjected to DNA 
sequence analysis. Thus another aspect of this invention is a human protein hBMP-3, produced by this 
15 method. 

Another aspect of the invention provides pharmaceutical compositions containing a therapeutically 
effective amount of one or more bone growth factor polypeptides according to the invention in a 
pharmaceutically acceptable vehicle. These compositions may further include other therapeutically useful 
agents. They may also include an appropriate matrix for delivering the proteins to the site of the bone 
20 defect and for providing a structure for bone growth. These compositions may be employed in methods for 
treating a number of bone defects and periodontal disease. These methods, according to the invention, 
entail administering to a patient needing such bone formation an effective amount of at least one of the 
novel proteins BMP-1, BMP-2 Class I, BMP-2 Class-ll, and BMP-3 as described herein. 

Still a further aspect of the invention are DNA sequences coding on expression for a human or bovine 
25 polypeptide having the ability to induce bone formation. Such sequences include the sequence of 
nucleotides in a 5’ to 3' direction illustrated in Tables II through VIII. Alternatively, a DNA sequence which 
hybridizes under stringent conditions with the DNA sequences of Tables II - VIII or a DNA sequence which 
hybridizes under non-stringent conditions with the illustrated DNA sequences and which codes on expres- 
sion for a protein having at least one bone growth factor biological property are included in the present 
30 invention. Finally, allelic or other variations of the sequences of Tables II through VIII, whether such 
nucleotide changes result in changes in the peptide sequence or not, are also included in the present 
invention. 

Still a further aspect of the invention is a vector containing a DNA sequence as described above in 
operative association with an expression control sequence. Such vector may be employed in a novel 
35 process for producing a bone growth factor polypeptide in which a cell line transformed with a DNA 
sequence encoding expression of a bone growth factor polypeptide in operative association with an 
expression control sequence therefor, is cultured. This claimed process may employ a number of known 
cells as host cells for expression of the polypeptide. Presently preferred cell lines are mammalian cell lines 
and bacterial cells. 

40 Other aspects and advantages of the present invention will be apparent upon consideration of the 
following detailed description and preferred embodiments thereof. 

Detailed Description of the Invention 

45 The proteins of the present invention are characterized by amino acid sequences or portions thereof the 
same as or substantially homologous to the sequences shown in Tables II - VIII below. These proteins are 
also characterized by the ability to induce bone formation. 

The bone growth factors provided herein also include factors encoded by the sequences similar to 
those of Tables II - VIII, but into which modifications are naturally provided (e.g. allelic variations in the 
so nucleotide sequence which may result in amino acid changes in the polypeptide) or deliberately en- 
gineered. For example, synthetic polypeptides may wholly or partially duplicate continuous sequences of 
the amino acid residues of Tables II - VIII. These sequences, by virtue of sharing primary, secondary, or 
tertiary structural and conformational characteristics with bone growth factor polypeptides of Tables II - VIII 
may possess bone growth factor biological properties in common therewith. Thus, they may be employed 
55 as biologically active substitutes for naturally-occurring bone growth factor polypeptides in therapeutic 
processes. 

Other specific mutations of the sequences of the bone growth factors described herein involve 
modifications of one or both of the glycosylation sites. The absence of glycosylation or only partial 
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glycosylation results from amino acid substitution or deletion at one or both of the asparagine-linked 
glycosylation recognition sites present in the sequences of the bone growth factors shown in Tables ll-VIII. 
The asparagine-linked glycosylation recognition sites comprise tripeptide sequences which are specifically 
recognized by appropriate cellular glycosylation enzymes. These tripeptide sequences are either as- 
5 paragine-X-threonine or asparagine-X-serine, where X is usually any amino acid. A variety of amino acid 
substitutions or deletions at one or both of the first or third amino acid positions of a glycosylation 
recognition site (and/or amino acid deletion at the second position) results in non-giycosylation at the 
modified tripeptide sequence. 

The present invention also encompasses the novel DNA sequences, free of association with DNA 
io sequences encoding other proteinaceous materials, and coding on expression for bone growth factors. 
These DNA sequences include those depicted in Tables II - VIII in a 5’ to 3‘ direction and those sequences 
which hybridize under stringent hybridization conditions [see, T. Maniatis et al, Molecular Cloning (A 
Laboratory Manual) , Cold Spring Harbor Laboratory (1982), pages 387 to 389] to the DNA sequences of 
Tables II - VIII. 

15 DNA sequences which hybridize to the sequences of Tables II - VIII under relaxed hybridization 
conditions and which code on expression for bone growth factors having bone growth factor biological 
properties also encode bone growth factors of the invention. For example, a DNA sequence which shares 
regions of significant homology, e.g., sites of glycosylation or disulfide linkages, with the sequences of 
Tables II - VIII and encodes a bone growth factor having one or more bone growth factor biological 
20 properties clearly encodes a member of this novel family of growth factors, even if such a DNA sequence 
would not stringently hybridize to the sequence of Tables II - VIII. 

Similarly, DNA sequences which code for bone growth factor polypeptides coded for by the sequences 
of Tables II - VIII, but which differ in codon sequence due to the degeneracies of the genetic code or allelic 
variations (naturally-occurring base changes in the species population which may or may not-result in an 
25 amino acid change) also encode the novel growth factors described herein. Variations in the DNA 
sequences of Tables II -VIII which are caused by point mutations or by induced modifications to enhance 
the activity, half-life or production of the polypeptides encoded thereby are also encompassed in the 
invention. 

Another aspect of the present invention provides a novel method for producing the novel osteoinductive 
30 factors. The method of the present invention involves culturing a suitable cell or cell line, which has been 
transformed with a DNA sequence coding on expression for a novel bone growth factor polypeptide of the 
invention, under the control of known regulatory sequences. Suitable cells or cell lines may be mammalian 
cells, such as Chinese hamster ovary cells (CHO). The selection of suitable mammalian host cells and 
methods for transformation, culture, amplification, screening and product production and purification are 
35 known in the art. See, e.g., Gething and Sambrook, Nature , 293:620-625 (1981), or alternatively, Kaufman et 
al, Mol, Cell. Biol. , 5(7):1 750-1 759 (1985) or Howley et al, U.S. Patent 4,419,446. Another suitable 
mammalian cell line, which is described in the accompanying examples, is the monkey COS-1 cell line. A 
similarly useful mammalian cell line is the CV-1 cell line. 

Bacterial cells are suitable hosts. For example, the various strains of E. coli (e.g., HB101, MC1061) are 
40 well-known as host cells in the field of biotechnology. Various strains of B. subtilis, Pseudomonas, other 
bacilli and the like may also be employed in this method. ’ 

Many strains of yeast cells known to those skilled in the art are also available as host cells for 
expression of the polypeptides of the present' invention. Additionally, where desired, insect cells may be 
utilized as host cells in the method of the present invention. See, e.g. Miller et al. Genetic Engineering, 
45 8:277-298 (Plenum Press 1986) and references cited therein. — — 

Another aspect of the present invention provides vectors for use in the method of expression of these 
novel osteoinductive polypeptides. Preferably the vectors contain the full novel DNA sequences described 
above which code for the novel factors of the invention. Additionally the vectors also contain appropriate 
expression control sequences permitting expression of the bone inductive protein sequences. Alternatively, 
so vectors incorporating modified sequences as described above are also embodiments of the present 
invention and useful in the production of the bone inductive proteins. The vectors may be employed in the 
method of transforming cell lines and contain selected regulatory sequences in operative association with 
the DNA coding sequences of the invention which are capable of directing the replication and expression 
thereof in selected host cells. Useful regulatory sequences for such vectors are known to one of skill in the 
55 art and may be selected depending upon the selected host cells. Such selection is routine and does not 
form part of the present invention. 

A protein of the present invention, which induces bone growth in circumstances where bone is not 
normally formed, has application in the healing of bone fractures. An osteogenic preparation employing one 
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or more of the proteins of the invention may have prophylactic use in closed as well as open fracture 
reduction and also in the improved fixation of artificial joints. De novo bone formation induced by an 
osteogenic agent contributes to the repair of congenital, trauma induced, or oncologic resection induced 
craniofacial defects, and also is useful in cosmetic plastic surgery. An osteogenic factor of the invention 
5 may be valuable in the treatment of periodontal disease, and in other tooth repair processes. Such agents 
may provide an environment to attract bone-forming cells, stimulate growth of bone-forming cells or induce 
differentiation of progenitors of bone-forming cells. Of course, the proteins of the invention may have other 
therapeutic uses. 

A further aspect of the invention is a therapeutic method and composition for repairing fractures and 
io other conditions related to bone defects or periodontal diseases. Such a composition comprises a 
therapeutically effective amount of at least one of the bone inductive factor proteins of the invention. The 
bone inductive factors according to the present invention may be present in a therapeutic composition in 
admixture with a pharmaceutically acceptable vehicle or matrix. Further therapeutic methods and composi- 
tions of the invention comprise a therapeutic amount of a bone inductive factor of the invention with a 

15 therapeutic amount of at least one of the other bone inductive factors of the invention. Additionally, the 

proteins according to the present invention or a combination of the proteins of the present invention may be 
co-administered with one or more different osteoinductive factors with which it may interact. Further, the 
bone inductive proteins may be combined with other agents beneficial to the treatment of the bone defect in 
question. Such agents include, but are not limited to various growth factors. The preparation of such 
20 physiologically acceptable protein compositions, having due regard to pH, isotonicity, stability and the like, 
is within the skill of the art. 

In particular, BMP-1 may be used individually in a composition. BMP-1 may also be used in 
combination with one or more of the other proteins of the invention. BMP-1 and BMP-2 Class I may be 
used in combination. BMP-1 and BMP-2 Class II may also be used in combination. BMP-1 and BMP-3 may 
25 be used in combination. Furthermore, BMP-1 may be used in combination with two or three of the other 

proteins of the invention. For example, BMP-1, BMP-2 Class I, and BMP-2 Class II may be combined. BMP- 

1 may also be combined with BMP-2 Class I, and BMP-3. Further, BMP-1 may be combined with BMP-2 
Class II, and BMP-3. BMP-1, BMP-2 Class I, BMP-2 Class II, and BMP-3 may be combined. 

BMP-2 Class I may be used individually in a pharmaceutical composition. BMP-2 Class I may also be 
30 used in combination with one or more of the other proteins of the invention. BMP-2 Class I may be 
combined with BMP-2 Class II. It may also be combined with BMP-3. Further BMP-2 Class I. may be 
combined with BMP-2 Class II and BMP-3. 

BMP-2 Class II may be used individually in pharmaceutical composition. In addition, it may be used in 
combination with other proteins as identified above. Further it may be used in combination with BMP-3. 

35 BMP-3 may be used individually in a composition. It may further be used in the various combinations 
identified above. 

The therapeutic method includes locally adnhinistering the composition as an implant or device. When 
administered, the therapeutic composition for use in this invention is, of course, in a pyrogen-free, 
physiologically acceptable form. Further, the composition may desirably be encapsulated or injected in a 
40 viscous form for delivery to the site of bone damage. Preferably, the bone growth inductive factor 
composition would include a matrix capable of delivering the bone inductive factor to the site of bone 
damage, providing a structure for the developing bone and cartilage and optimally capable of being 
resorbed into the body. Such matrices may be formed of other materials presently in use for other 
implanted medical applications. 

45 The choice of material is based on, for example, biocompatibility, biodegradability, mechanical prop- 
erties, cosmetic appearance and interface properties. Similarly, the application of the osteoinductive factors 
will define the appropriate formulation. Potential matrices for the osteoinductive factors may be biodeg- 
radable and chemically defined, such as, but not limited to calcium sulfate, tricalciumphosphate, hydrox- 
yapatite, polylactic acid, polyanhydrides; biodegradable and biologically well defined, such as bone or 
so dermal collagen, other pure proteins or extracellular matrix components; nonbiodegradable and chemically 
defined, such as sintered hydroxyapatite, bioglass, aluminates, or other ceramics; or combinations of any of 
the above mentioned types of material, such as polylactic acid and hydroxyapatite or collagen and 
tricalciumphosphate. The bioceramics might also be altered in composition, such as in calcium-aluminate- 
phosphate and processing to alter for example, pore size, particle size, particle shape, and biodegradability. 
55 The dosage regimen will be determined by the attending physician considering various factors which 
modify the action of such a growth factor, e.g. amount of bone weight desired to be formed, the site of bone 
damage, the condition of the damaged bone, the patient’s age, sex, and diet, the severity of any infection, 
time of administration and other clinical factors. The dosage may vary with the type of matrix used in the 
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reconstitution and the composition of BMP’s. The addition of other known growth factors, such as IGF 1 
(insulin like growth factor 1), to the final composition, may also effect the dosage. Generally, the dosage 
regimen should be in the range of approximately 10 to 10 6 nanograms of protein per gram of bone weight 
desired. Progress can be monitored by periodic assessment of bone growth and/or repair, e.g. x-rays. Such 
5 therapeutic compositions are also presently valuable for veterinary applications due to the lack of species 
specificity in bone inductive factors. Particularly domestic animals and thoroughbred horses in addition to 
humans are desired patients for such treatment with the bone inductive factors of the present invention. 

The following examples illustrate practice of the present invention in recovering and characterizing the 
bovine proteins and employing them to recover the human proteins, obtaining the human proteins and in 
io expressing the proteins via recombinant techniques. 

EXAMPLE I 

Isolation of Bovine Bone Inductive Factor 

15 

Ground bovine bone powder (20-120 mesh, Helitrex) is prepared according to the procedures of M. R. 
Uristet al., Proc. Natl Acad. Sci USA , 70:3511 (1973) with elimination of some extraction steps as identified 
below. Ten kgs of the ground powder is demineralized in successive changes of 0.6N HCI at 4*C over a 48 
hour period with vigorous stirring. The resulting suspension is extracted for 16 hours at 4*C with 50 liters of 
20 2M CaCb and 10mM ethylenediamine-tetraacetic acid [EDTA], and followed by extraction for 4 hours in 50 
liters of 0.5M EDTA. The residue is washed three times with distilled water before its resuspension in 20 
liters of 4M guanidine hydrochloride [GuCI], 20mM Tris (pH 7.4), ImM N-ethylmaleimide, ImM 
iodoacetamide, ImM phenylmethylsulfonyl fluorine as described in Clin. Orthop. Rel. Res. , 171: 213 (1982). 
After 16 to 20 hours the supernatant is removed and replaced with another 10 liters of GuCI buffer. The 
25 residue is extracted for another 24 hours. 

The crude GuCI extracts are combined, concentrated approximately 20 times on a Pellicon apparatus 
with a 10,000 molecular weight cut-off membrane; and then dialyzed in 50mM Tris, 0.1 M NaCI, 6M urea 
(pH7.2), the starting buffer for the first column. After extensive dialysis the protein is loaded on a 4 liter 
DEAF cellulose column and the unbound fractions are collected. 

30 The unbound fractions are concentrated and dialyzed against 50mM NaAc, 50mM NaCI (pH 4.6) in 6M 
urea. The unbound fractions are applied to a carboxymethyl cellulose column. Protein not bound to the 
column is removed by extensive washing with starting buffer, and the bone inductive factor containing 
material desorbed from the column by 50mM NaAc, 0.25mM NaCI, 6M urea (pH 4.6). The protein from this 
step elution is concentrated 20- to 40- fold, then diluted 5 times with 80mM KP0 4 , 6M urea (pH6.0). The pH 
35 of the solution is adjusted to 6.0 with 500mM foHPO*. The sample is applied to an hydroxylapatite column 
(LKB) equilibrated in 80mM KP0 4 , 6M urea (pH6.0) and all unbound protein is removed by washing the 
column with the same buffer. Bone inductive factor activity is eluted with lOOmM KP0 4 (pH7.4) and 6M 
urea. 

The protein is concentrated approximately 10 times, and solid NaCI added to a final concentration of 
40 0.1 5M. This material is applied to a heparin - Sepharose column equilibrated in 50mM KP0 4 , 150mM NaCI, 

6M urea (pH7.4). After extensive washing of the column with starting buffer, a protein with bone inductive 
factors activity is eluted by 50mM KP0 4 , 700mM NaCI, 6M urea (pH7.4). This fraction is concentrated to a 
minimum volume, and 0.4ml aliquots are applied to Superose 6 and Superose 12 columns connected in 
series, equilibrated with 4M GuCI, 20mM Tris (pH7.2) and the columns developed at a flow rate of 
45 0.25ml/min. The protein demonstrating bone inductive factor activity has a relative migration corresponding 

to approximately 30,000 dalton protein. 

The above fractions are pooled, dialyzed against 50mM NaAc, 6M urea (pH4.6), and applied to a 
Pharmacia MonoS HR column. The column is developed with a gradient to 1.0M NaCI, 50mM NaAc, 6M 
urea (pH4.6). Active fractions are pooled and brought to pH3.0 with 10% trifluoroacetic acid (TFA). The 
so material is applied to a 0.46 x 25cm Vydac C4 column in 0.1% TFA and the column developed with a 
gradient to 90% acetonitrile, 0.1% TFA (31.5% acetonitrile, 0.1% TFA to 49.5% acetonitrile, 0.1% TFA in 60 
minutes at 1ml per minute). Active material is eluted at approximately 40-44% acetonitrile. Aliquots of the 
appropriate fractions are iodinated by one of the following methods: P. J. McConahey et al, Int. Arch. 
Allergy , 29:185-189 (1966); A. E. Bolton et al, Biochem J „ 133:529 (1973); and D. F. Bowen-Pope, J. Biol. 
55 Chem. , 237:5161 (1982). The iodinated proteins present in these fractions are analyzed by SDS gel 
electrophoresis and urea Triton X 100 isoelectric focusing. At this stage, the bone inductive factor is 
estimated to be approximately 10-50% pure. 
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EXAMPLE II 

Characterization of Bovine Bone Inductive Factor 
5 A. Molecular Weight 

Approximately 20ug protein from Example I is lyophilized and redissolved in IX SDS sample buffer. 
After 15 minutes of heating at 37 *C, the sample is applied to a 15% SDS polyacrylamide gel and then 
electrophoresed with cooling. The molecular weight is determined relative to prestained molecular weight 
io standards (Bethesda Research Labs). Immediately after completion, the gel lane containing bone inductive 
factor is sliced into 0.3cm pieces. Each piece is mashed and 1.4ml of 0.1% SDS is added. The samples are 
shaken gently overnight at room temperature to elute the protein. Each gel slice is desalted to prevent 
interference in the biological assay. The supernatant from each sample is acidified to pH 3.0 with 10% TFA, 
filtered through a 0.45 micron membrane and loaded on a 0.46cm x 5cm C4 Vydac column developed with 
75 a gradient of 0.1% TFA to 0.1% TFA, 90% CH3CN. The appropriate bone inductive factor - containing 
fractions are pooled and reconstituted with 20mg rat matrix. In this gel system, the majority of bone 
inductive factor fractions have the mobility of a protein having a molecular weight of approximately 28,000 - 
30,000 daltons. 

20 B. Isoelectric Focusing 

The isoelectric point of bone inductive factor activity is determined in a denaturing isoelectric focusing 
system. The Triton X100 urea gel system (Hoeffer Scientific) is modified as follows: 1) 40% of the 
ampholytes used are Servalyte 3/10; 60% are Servalyte 7-9. 2) The catholyte used is 40mM NaOH. 
25 Approximately 20ug of protein from Example I is lyophilized, dissolved in sample buffer and applied to the 
isoelectrofocusing gel. The gel is run at 20 watts, 10 *C for approximately 3 hours. At completion the lane 
containing bone inductive factor is sliced into 0.5 cm slices. Each piece is mashed in 1.0ml 6M urea, 5mM 
Tris (pH 7.8) and the samples agitated at room temperature. The samples are acidified, filtered, desalted 
and assayed as described above. The major portion of activity as determined in the assay described in 
30 Example III migrates in a manner consistent with a pi of 8.8 - 9.2. 

C. Subunit Characterization 

The subunit composition of bone inductive factor is also determined. Pure bone inductive factor is 
35 isolated from a preparative 15% SDS gel as described above. A portion of the sample is then reduced with 
5mM DTT in sample buffer and re-efectrophoresed on a 15% SDS gel. The approximately 30kd protein 
yields two major bands at approximately 20kd and 18kd, as well as a minor band at 30kd. The broadness of 
the two bands indicates heterogeneity caused most probably by glycosylation, other post translational 
modification, proteolytic degradation or carbamylation. 

40 

EXAMPLE III 

Biological Activity of Bone Inductive Factor 

45 A rat bone formation assay according to the general procedure of Sampath and Reddi, Proc. Natl. Acad. 
Sci. U.S.A. , 80:6591-6595 (1983) is used to evaluate the osteogenic activity of the bovine bone inductive 
factor of the present invention obtained in Example I. This assay can also be used to evaluate bone 
inductive factors of other species. The ethanol precipitation step is replaced by dialyzing the fraction to be 
assayed against water. The solution or suspension is then redissolved in a volatile solvent, e.g. 0.1 - 0.2 % 
so TFA, and the resulting solution added to 20mg of rat matrix. This material is frozen and lyophilized and the 
resulting powder enclosed in #5 gelatin capsules. The capsules are implanted subcutaneously in the 
abdominal thoracic area of 21 - 49 day old male long Evans rats. The implants are removed after 7-14 
days. Half of each implant is used for alkaline phosphatase analysis [See, A. H. Reddi et al., Proc. Natl 
Acad Sci. , 69:1601 (1972)] and half is fixed and processed for histological analysis. Routinely, lum 
55 glycolmethacrylate sections are stained with Von Kossa and acid fuschin to detect new bone mineral. 
Alkaline phosphatase, an enzyme produced by chondroblasts and osteoblasts in the process of matrix 
formation, is also measured. New cartilage and bone formation often correlates with alkaline phosphatase 
levels. Table I below illustrates the dose response of the rat matrix samples including a control not treated 
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with bone inductive factor. 



TABLE 1 

5 



10 



*At this stage the bone inductive factor is approximately 10-15% pure. 



75 The bone or cartilage formed is physically confined to the space occupied by the matrix. Samples are 
also analyzed by SDS gel electrophoresis and isoelectric focusing as described above, followed by 
autoradiography. Analysis reveals a correlation of activity with protein bands at 28 - 30kd and a pi 9.0. An 
extinction coefficient of 1 OD/mg-cm is used as an estimate for protein and approximating the purity of 
bone inductive factor in a particular fraction. In the in vivo rat bone formation assays on dilutions as 
20 described above, the protein is active in vivo at 10 to 200ng protein/gram bone to probably greater than lug 
protein/gram bone. 

EXAMPLE IV 

25 Bovine Bone Inductive Factor Protein Composition 

The protein composition of Example IIA of molecular weight 28 - 30kd is reduced as described in 
Example IIC and digested with trypsin. Eight tryptic fragments are isolated by standard procedures having 
the following amino acid sequences: 

30 Fragment 1: AAFL6DI ALDEED LG 

Fragment 2: A F Q V Q Q A A D L 
Fragment 3:NYQDMVVEG 
Fragment 4:STPAQDVSR 
Fragment 5: N Q E A L R 
35 Fragment 6:LSEPDPSHTLEE 

Fragment 7: F D A Y Y 
Fragment 8:LKPSN?ATIQSIVE 

A less highly purified preparation of protein from bovine bone is prepared according to a purification 
scheme similar to that described in Example I. The purification, basically varies from that previously 
40 described by omission of the DE-52 column, the CM cellulose column and the mono S column, as well as a 
reversal in the order of the hydroxylapatite and heparin sepharose columns. Briefly, the concentrted crude 4 
M extract is brought to 85% final concentration of ethanol at 4 degrees. The mixture is then centrifuged, 
and the precipitate redissolved in 50 mM Tris, 0.15 M NaCI, 6.0 M urea. This material is then fractionated 
on Heparin Sepharose as described. The Heparin bound material is fractionated on hydroxyapatite as 
45 described. The active fractions are pooled, concentrated, and fractionated on a high resolution gel filtration 
(TSK 30000 in 6 M guanidinium chloride, 50 mM Tris, pH 7.2). The active fractions are pooled, dialyzed 
against 0.1% TFA, and then fractionated on a C4 Vydac reverse phase column as described. The 
preparation is reduced and electrophoresed on an acrylamide gel. The protein corresponding to the 18K 
band is eluted and digested with trypsin. Tryptic fragments are isolated having the following amino acid 
so sequences: 

Fragment 9:SLKPSNHATIQS?V 
Fragment 10: S FD AYYC S? A 
Fragment 11:VY PNMTVESCA 
Fragment 12: V D F A D I ? W 

55 Tryptic Fragments 7 and 8 are noted to be substantially the same as Fragments 10 and 9, respectively. 



Protein* Implanted ug 


Cartilage 


Aik. Phos.u/1 


7.5 


2 


Not done 


2.5 


3 


445.7 


0.83 


3 


77.4 


0.28 


0 


32.5 


0.00 


0 


31.0 
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A. bBMP-1 



Probes consisting of pools of oligonucleotides (or unique oligonucleotides) are designed according to 
the method of R. Lathe, J. Mol. Biol ., 183 (1):1-12 (1985) and synthesized on an automated DNA 
5 synthesizer. One probe consists of a relatively Jong (32 nucleotides) "guessmer" [See J. J. Toole et al, 
Nature , 312:342-347 (1984)] of the following nucleotide sequence: 
TCCTCATCCAGGGCAATGTCGCCCAGGAAGGC 

Because the genetic code is degenerate (more than one codon can code for the same amino acid), the 
number of oligonucleotides in a probe pool is reduced based on the frequency of codon usage in 
io eukaryotes, the relative stability of G:T base pairs, and the relative infrequency of the dinucleotide CpG in 
eukaryotic coding sequences [see Toole et al. p supra.]. The second set of probes consists of shorter 
oligonucleotides (17 nucleotides in length) which contain all possible sequences that could encode the 
amino acids. The second set of probes has the following sequences: 

(a) A [A/G] [A/G] TC [T/C] TC [T/C] TC [A/G] TC [T/C] AA 
75 (b) A [A/G] [A/G] TC [T/C] TC [T/C] TC [A/G] TCNAG 

Bracketed nucleotides are alternatives. "N" means either A, T, C or G. 

In both cases the regions of the amino acid sequence used for probe design are chosen by avoiding 
highly degenerate codons where possible. The oligonucleotides are synthesized on an automated DNA 
synthesizer; the probes are then radioactively labeled with polynucleotide kinase and ^P-ATP. 

20 These two sets of probes are used to screen a bovine genomic recombinant library. The library is 
constructed as follows: Bovine liver DNA is partially digested with the restriction endonuclease enzyme Sau 
3A and sedimented through a sucrose gradient. Size fractionated DNA in the range of 15-30kb is then 
ligated to the bacteriophage Bam HI vector EMBL3 [Frischauf et al, J. Mol. Biol. , 170:827-842 (1983)]. The 
library is plated at 8000 recombinants per plate. Duplicate nitrocellulose replicas of the plaques are made 
25 and amplified according to a modification of the procedure of Woo et al, Proc. Natl. Acad. Sci. USA, 
75:3688-91 (1978). ’ 

The 32 mer probe is kinased with 32 P-gamma-ATP and hybridized to one set of filters in 5X SSC, 0.1% 
SDS, 5X Denhardts, lOOug/ml salmon sperm DNA at 45 degrees C and washed with 5X SSC, 0.1% SDS at 
45 degrees C. The 17 mer probes are kinased and hybridized to the other set of filters in 3M 
30 tetramethylammonium chloride (TMAC), 0.1 M sodium phosphate pH6.5, ImM EDTA, 5X Denhardts, 0.6% 
SDS, lOOug/ml salmon sperm DNA at 48 degrees C, and washed in 3M TMAC, 50mM Tris pH8.0 at 50 
degrees C. These conditions minimize the detection of mismatches to the 17 mer probe pool [see, Wood et 
Proc. Natl. Acad. Sci, U.S.A., 82:1585-1588 (1985)]. 400,000 recombinants are screened by this 
procedure and one duplicate positive is plaque purified. DNA is isolated from a plate lysate of this 
35 recombinant bacteriophage designated lambda bP-50. bP-50. was deposited December 16, 1986 with the 
American Type Culture Collection, 12301 Parklawn Drive, Rockville, Maryland USA (hereinafter the "ATCC") 
under accession number 40295. This deposit as well as the other deposits contained herein meets the 
requirements of the Budapest Treaty on the International Recognition of the Deposit of Microorganisms for 
the Purposes of Patent Procedure and Regulations thereunder. This bp-50 clone encodes at least a portion 
40 of the bovine bone growth factor designated bBMP-1. 

The oligonucleotide hybridizing region of this bBMP-1 clone is localized to an approximately 800bp Eco 
Rl fragment which is subcloned into M13 and sequenced by standard techniques. The partial DNA 
sequence and derived amino acid sequence of lambda bP-50 are shown below in Table II. The amino acid 
sequences corresponding to the tryptic fragments isolated from the bovine bone 28 to 30kd material are 
45 underlined in Table II. The first underlined portion of the sequence corresponds to tryptic Fragment 1 above 
from which the oligonucleotide probes are designed. The second underlined portion corresponds to tryptic 
Fragment 2 above. The predicted amino acid sequence indicates that tryptic Fragment 2 is preceded by a 
basic residue (R) as expected considering the specificity of trypsin. The nucleic acid sequence preceding 
the couplet CT at nucleotide positions #292-293 in Table II is presumed to be an intron (noncoding 
so sequence) based on the presence of a consensus acceptor sequence (i.e., a pyrimidine rich tract, 
TCTCTCTCC, followed by AG) and the lack of a basic residue in the appropriate position of the derived 
amino acid sequence. This bBMP-1 genomic sequence appears in Table II. The presumptive bBMP-1 
peptide sequence from this genomic clone is 37 amino acids in length and is encoded by the DNA 
sequence from nucleotide #294 through #404 in Table II. 

55 
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TABLE II 

280 . 290 (1) 308 323 

Cui'iGCCTCT TCTCTCTCCA GCT GCC TTC CIT GGG GAC ATC GCC CIG GAC GAG GAG 
5 Ala Fhe Leu Glv Asp lie Ala Leu Asa Glu Glu 

338 353 368 

GAC TIG AGG GCC TTC CAA GIG CAG CAG GCT GOG GAC CTC AGA CAG OGT GCA ACC 

^ Arg Ala Phe Gin Val Gin Gin Ala Ala Asp Tph Arg Gin Arg Ala Thr 

° 383 398 (37) 414 424 

CGC AGG TCT TCC ATC AAA GCT GCA GCTACACIGG GLACAGGCCA 

Arg Arg Ser Ser lie Iys Ala Ala 



B. bBMP-2 

Two probes consisting of pools of oligonucleotides are designed on the basis of the amino acid 
20 sequence of Fragment 3 and synthesized on an automated DNA synthesizer as described above. 

Probe #1:ACNACCAT [A/G] T C [T/C] T G [A/G] A T 
Probe #2: C A [A/G] G A [T/C] ATGGTNGTNGA 
These probes are radioactively labeled and employed to screen the bovine genomic library constructed as 
described in part A except that the vector is lambda J1 Bam HI arms [Mullins et al Nature 308: 856-858 

25 (1984).] The radioactively labelled 17-mer Probe #1 is hybridized to the set of filters according to the 

method for the 17 mer probe described in part A. 

400,000 recombinants are screened by the procedure described above in Part A. One duplicate positive 
is plaque purified and the DNA is isolated from a plate lysate of the recombinant bacteriophage designated 
lambda bP-21. Bacteriophage bP-21 was deposited with the ATCC under accession number ATCC 40310 
30 on March 6, 1987. The bP-21 clone encodes the bovine growth factor designated bBMP-2. 

The oligonucleotide hybridizing region of this bBMP-2 clone is localized to an approximately 1.2 kb Sac 
I restriction fragment which is subcloned into Ml 3 and sequenced by standard techniques. The partial DNA 
sequence and derived amino acid sequence of this Sac I fragment and the contiguous Hind Ill-Sac I 
restriction fragment of bP-21 are shown below in Table III. The bBMP-2 peptide sequence from this clone is 
35 129 amino acids in length and is encoded by the DNA sequence from nucleotide #1 through nucleotide 

#387. The amino acid sequence corresponding to the tryptic fragment isolated from the bovine bone 28 to 
30kd material is underlined in Table ill. The underlined portion of the sequence corresponds to tryptic 
Fragment 3 above from which the oligonucleotide probes for bBMP-2 are designed. The predicted amino 
acid sequence indicates that tryptic Fragment 3 is preceded by a basic residue (K) as expected considering 
40 the specificity of trypsin. The arginine residue encoded by the CGT triplet is presumed to be the carboxy- 
terminus of the protein based on the presence of a stop codon (TAG) adjacent to it. 
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TABLE III 



(1) 

GGC 


CAC 


GAT 


GGG 


15 

AAA 


GGA 


CAC 


CCT 


CTC 


30 

CAC 


AGA 


AGA 


GAA 


AAG 


45 

CGG 


G 


H 


D 


G 


K 


G 


H 


P 


L 


H 


R 


R 


E 


K 


R 


CAA 


GCA 


AAA 


CAC 


60 

AAA 


CAG 


CGG 


AAA 


CGC 


75 

CTC 


AAG 


TCC 


AGC 


TGT 


90 

AAG 


Q 


A 


K 


H ’ 


K 


Q 


R 


K 


R 


L 


K 


S 


S 


C 


K 


AGA 


CAC 


CCT 


TTA 


105 

TAT 


GTG 


GAC 


TTC 


AGT 


120 

GAT 


GTG 


GGG 


TGG 


AAT 


135 

GAC 


R 


H 


P 


L 


Y 


V 


D 


F 


S 


D 


V 


G 


W 


N 


D 


TGG 


ATC 


GTT 


GCA 


150 

CCG 


CCG 


GGG 


TAT 


CAT 


165 

GCC 


TTT 


TAC 


TGC 


CAT 


180 

GGG 


W 


I 


V 


A 


P 


P 


G 


Y 


H 


A 


F 


Y 


C 


H 


G 


GAG 


TGC 


CCT 


TTT 


195 

CCC 


CTG. 


GCC 


GAT 


CAC 


210 

CTT 


AAC 


TCC 


ACG 


AAT 


225 

CAT 


E 


C 


P 


F 


p 


L 


A 


D 


H 


L 


N 


S 


T 


N 


H 


GCC 


ATT 


CTC 


CAA 


240 

ACT 


CTG 


GTC 


AAC 


TCA 


255 

GTT 


AAC 


TCT 


AAG 


ATT 


270 

CCC 


A 


I 


V 


Q 


T 


L 


V 


N 


S 


V 


N 


S 


K 


I 


P 


AAG 


GCA 


TGC 


TGT 


385 

GTC 


CCA 


ACA 


GAG 


CTC 


300 

AGC 


GCC 


ATC 


TCC 


ATG 


315 

CTG 


K 


A 


C 


C 


V 


P 


T 


E 


L 


S 


A 


■I 


S 


M 


L 


TAC 


CTT 


GAT 


GAG 


330 

AAT 


GAG 


AAG 


GTG 


GTA 


345 

TTA 


AAG 


AAC 


TAT 


CAG 


360 

GAC 


Y 


L 


D 


E 


N 


E 


K 


V 


V 


L 


K 


N 


Y 


_Q 


D 


ATG 


GTT 


GTC 


GAG 


375 

GGT 


TGT 


GGG 


TGT 


(129) 397 

CGT TAGCACAGCA . 


407 

AAATAAAATA 


M 


V 


V 


E 


G 


C 


G 


C 


R 















417 427 437 447 457 

40 TAAATATATA TATATATATA TTAGAAAAAC AGCAAAAAAA TCAAGTTGAC 

467 477 487 497 507 

ACTTTAATAT TTCCCAATGA AGACTTTATT TATGGAATGG AATG GAG AAA 

45 517 527 537 547 557 

AAGAAAAACA CAGCTATTTT GAAAACTATA TTTATATCTA CCGAAAAGAA 



567 577 587 

GTTGGGAAAA CAAATATTTT AATCAGAGAA TTATT 

50 



C. bBMP-3 



55 Probes consisting of pools of oligonucleotides are designed on the basis of the amino acid sequences 
of the tryptic Fragments 9 (Probe #3), 10 (Probe #2), and 11 (Probe #1), and synthesized on an automated 
DNA synthesizer. 

Probe #1:ACNGT CAT [A/G] T T N G G [A/G] T A 
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Probe #2: C A [A/G] T A [A/G] T A N G C [A/G] T C [A/G] A A 
Probe #3: T G [A/G/T] ATNGTNGC [A/G] T G [A/G] T T 

A recombinant bovine genomic library constructed in EMBL3 is screened by the TMAC hybridization 
procedure detailed above in part A. 400,000 recombinants are screened in duplicate with Probe #1 which 
5 has been labeled with 32 P. All recombinants which hybridized to this probe are replated for secondaries. 
Triplicate nitrocellulose replicas are made of the secondary plates, and amplified as described. The three 
sets of filters are hybridized to Probes #1 , # 2 and #3, again under TMAC conditions. One clone, lambda bP- 
81 9, . hybridizes to all three probes and is plaque purified and DNA is isolated from a plate lysate. * 
Bacteriophage lambda bP-819 was deposited with the ATCC on June 16, 1987 under accession number 
io 40344. This bP-819 clone encodes the bovine bone growth factor designated bBMP-3. 

The region of bP-819 which hybridizes to Probe #2 is localized and sequenced. The partial DNA and 
derived amino acid sequences of this region are shown in Table IVA. The amino acid sequences 
corresponding to tryptic Fragments 10 and 12 are underlined. The first underlined sequence corresponds to 
Fragment 12 while the second corresponds to Fragment 10. This region of bP-819, therefore, which 
is hybridizes to Probe #2 encodes at least III amino acids. This amino acid sequence is encoded by the DNA 
sequence from nucleotide #414 through #746. 

TABLE IV. A. 



20 



25 



30 



35 



40 



45 



383 393 403 413 (1) 428 

GAGGAGGAAG OGGTCIACGG GGCTCCITCr GCCTCTGCAG AAC AAT GAG CIT CCT GGG GCA 

Asn Asn Glu Leu Pro Gly Ala 

443 458 473 488 

GAA TAT CAG TAC AAG GAG GAT GAA GTA TCG GAG GAG AGG AAG CCT TAC AAG ACT 

Glu Tyr Gin Tyr Lys Glu Asp Glu Val Trp Glu Glu Arg Lys Pro Tyr Lys Thr 

503 518 533 

CIT CAG ACT CAG COC CCT GAT AAG ACT AAG AAC AAA AAG AAA CAG AGG AAG GGA 

Ihu Gin Hir Gin Pro Pro Asp Lys Ser Lys Asn Lys Lys Lys Gin Arg Lys Gly 

548 563 578 593 

CCT CAG CAG AAG ACT CAG AGG CIC CAG TTT GAT GAA CAG ACC CIG AAG AAG GCA 

Pro Gin Gin Lys Ser Gin Thr Leu Gin Phe Asp Glu Gin Thr Leu Lys Lys Ala 

608 623 638 

AGA AGA AAG CAA TGG AIT GAA COC OGG AAT TCT GOC AGA CGG TAC CIT AAA GIG 

Arg Arg lys Gin Trp lie Glu Pro Arg Asn Cys Ala Arg Arg Tyr Leu Lys Val 

653 668 683 698 

GAC TIC GCA GAT AIT GGC TGG AGC GAA TGG AIT AIT TCC CCC AAG ICC TIC GAT 

Asp Phe Ala Asp H e Gly Trp Ser Glu Trp lie He Ser Pro Lys Ser Phe Asp 

713 728 743 (111) 756 

GCC TAT TAC TGC TCC GGA GOG TGC CAG TIC OCC ATG CCA AAG GTAGCCAITC 

Ala Tyr Tyr Cys Ser Gly Ala Cys Gin Phe Pro MET Pro Lys 

766 776 786 

TTTTTICTCC TGTCCITCCC ATTTCCATAG 



50 

The region of bP-819 which hybridizes to Probe #1 and #3 is localized and sequenced. The partial DNA 
and derived amino acid sequences of this region are shown in Table IVB. The amino acid sequences 
corresponding to tryptic Fragments 9 and 11 are underlined. The first underlined sequence corresponds to 
Fragment 9 while the second underlined sequence corresponds to Fragment 1 1 . The peptide sequence of 
55 this region of bP-819 which hybridizes to Probe #1 and #3 is 64 amino acids in length encoded by 
nucleotide #305 through #493 of Table IVB. The arginine residue encoded by the AGA triplet is presumed 
to be the carboxy-terminus of the protein based on the presence of a stop codon (TAA) adjacent to it. The 
nucleic acid sequence preceding the couplet TC (positions 305-306) is presumed to be an intron (non- 
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coding sequence) based on the presence of a consensus acceptor sequence (i.e. a pyrimidine-rich stretch, 
TTCTCCC I I 1 1CGTTCCT, followed by AG) and the presence of a stop rather than a basic residue in the 
appropriate position of the derived amino acid sequence. 

bBMP-3 is therefore characterized by the DNA and amino acid sequence of Table IV A and Table IV B. 

5 The peptide sequence of this clone is 175 amino acids in length and is encoded by the DNA sequence 
from nucleotide #414 through nucleotide #746 of Table IV A and nucleotide #305 through nucleotide #493 
of Table IV B. 



70 



TABLE IV. B. 



284 294 304 (112 ) 319 

CTAACCIGIG TTCTCCCTIT TCCTICCIAG. TCT TIG AAG CCA TCA AAT CAC GCT ACC 

75 Ser Leu Lvs Pro Ser Asrt His Ala Thr 

334 349 364 379 

ATC CAG ACT ATA GIG AGA GCT GIG GGG CTC GTC CCT GGA ATC CCC GAG CCT TGC 

' He Gin Ser He Val Arg Ala Val Gly Val Val Pro Gly lie Pro Glu Pro cys 

20 

394 409 424 439 

TGT GIG CCA GAA AAG ATG TOC TCA CTC AGC ATC TTA TTC TIT GAT GAA AAC AAG 

Cys Val Pro Glu Lys MET Ser Ser Leu Ser . He Leu Rie Fhe Asp Glu Asn lys 

as 454 469 484 (175) 

AAT GIG G3A CIT AAA GTA TAT CCA AAC ATG ACA GIA GAG TCT TGT GCT TGC AGA 

Asn Val Val Lsu Lys Val Tyr Pro Asn MET Thr Val Glu Ser Cvs Ala cys Arg 

503 513 523 533 

TAAOCIGGIG AAGAACTCAT CIGGATGCIT AACTCAATCG 



EXAMPLE V 

35 

Human Bone Inductive Factors 
A. hBMP-1 

40 Because the bovine and human bone growth factor genes are presumed to be significantly homologous, 
the bovine bBMP-1 DNA sequence of Table II (or portions thereof) is used as a probe to screen a human 
genomic library. The 800bp EcoRI fragment of the bovine genomic clone is labeled with 32 P by nick- 
translation. A human genomic library (Toole et al., supra ) is plated on 20 plates at 40,000 recombinants per 
plate. Duplicate nitrocellulose filter replicas are made of each plate and hybridized to the nick-translated 
45 probe in 5 X SSC, 5 X Denhardt’s, lOOug/ml denatured salmon sperm DNA, 0.1% SDS (the standard 
hybridization solution) at 50 degrees centigrade for approximately 14 hours. The filters are then washed in 1 
X SSC, 0.1% SDS at 50 degrees centigrade and subjected to autoradiography. Five duplicate positives are 
isolated and plaque purified. DNA is obtained from a plate lysate of one of these recombinant bac- 
teriophage, designated LP-H1. LP-H1 was deposited with the ATCC on March 6, 1987 under accession 
so number 40311. This clone encodes at least a portion of the humane genomic bone growth factor called 
hBMP-1. The hybridizing region of LP-H1 is localized to a 2.5kb Xbal/Hindlll restriction fragment. 

The partial DNA sequence and derived amino acid sequence of lambda LP-H1 are shown below in 
Table V. The peptide sequence from this clone is 37 amino acids in length and is encoded by the DNA 
sequence from nucleotide #3440 through nucleotide #3550. The coding sequence of Table V is flanked by 
55 approximately 28 nucleotides (a presumptive 5' noncoding sequence) as well as approximately 19 
nucleotides (a presumptive 3’ noncoding sequence. A comparison of the bBMP-1 sequence of Table II with 
the hBMP-1 genomic sequence of Table V indicates the significant homology between the two. 
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Because the size of coding regions and the positions of noncoding regions is generally conserved in 
homologous genes of different species, the locations of the coding and noncoding regions of the bone 
inductive factor genes may be identified. Regions of homology between the two species’ genes, flanked by 
RNA processing signals at homologous sites, indicate a coding region. 



TABLE V 

3419 3429 3439 (1) 3454 

10 CAGCCCTGGC TICITCTITT CTCITIAGCr GCC TIT CTT GGG GAC A IT GCC CIG GAC 

Ala Hie Leu Gly Asp lie Ala Leu Asp 

3469. 3484 3499 3514 

GAA GAG GAC CIG AGG GCC TTC CAG GTA CAG CAG GOT GTG GAT CTC AGA OGG CAC 
75 Glu Glu Asp Leu Arg Ala Hie Gin Val Gin Gin Ala Val Asp Leu Arg Arg His 

3529 3544 (37) 3560 3570 

ACA GOT OGT AAG TOC TCC ATC AAA GCT GCA GGEAAGCOGG GIGCCAATGG 
Thr Ala Arg Lys Ser Ser He Lys Ala Ala 

20 

A probe specific for the human coding sequence given in Table V is used to identify a human cell line 
or tissue which synthesizes bone inductive factor. The probe is made according to the following method; 
Two oligonucleotides having the following sequences: 

25 (a) GGGAATT CT G CCTTT CTT GGGG ACATTGCCCT GG ACGAAGAGGACCTG AG 

(b) CGGGATCCGTCTGAGATCCACAGCCTGCTGTACCTGGAAGGCCCTCAGG 
are synthesized on an automated synthesizer, annealed, extended using the Klenow fragment of E. coli 
DNA polymerase I, digested with the restriction-enzymes Eco Rl and Bam HI, and inserted into an Ml 3 
vector. A single-stranded 32 P-labeled probe is then from template preparation of this subclone by standard 
30 techniques. Polyadenylated RNAs from various cell and tissue sources are electrophoresed on formal- 
dehyde-agarose gels and transfered to nitrocellulose by the method of Toole et al. t supra. The probe is then 
hybridized to the nitrocellulose blot in 50% formamide, 5 X SSC, 0.1% SDS, 40 mM sodium phosphate pH 
6.5, 100 ug/ml denatured salmon sperm DNA, and 5 mM vanadyl ribonucleosides at 42° C overnight and 
washed at 65° C in 0.2 X SSC, 0.1% SDS. Following autoradiography, the lane containing RNA from the 
35 human osteosarcoma cell line U-2 OS contains hybridizing bands corresponding to RNA species of 
approximately 4.3 and 3.0 kb. 

cDNA is synthesized from U-2 OS polyadenylated RNA and cloned into lambda gtIO by established 
techniques (Toole et al., supra ). 20,000 recombinants from this library are plated on each of 50 plates.. 
Duplicate nitrocellulose replicas are made of the plates. The above described oligonucleotides are kinased 
40 with 32 P-gamma-ATP and hybridized to the two sets of replicas at 55° centigrade in standard hybridization 
solution overnight. The filters are then washed in 1 X SSC, 0.1% SDS at 55° centigrade and subjected to 
autoradiography. One duplicate positive, designated lambda U20S-1, is plaque purified. Lambda U20S-1 
was deposited with the ATCC on June 16, 1987 under accession number 40343. 

The entire nucleotide sequence and derived amino acid sequence of the insert of lambda U20S-1 is 
45 given in Table VI. This cDNA clone encodes a Met followed by a hydrophobic leader sequence characteris- 
tic of a secreted protein, and contains a stop codon at nucleotide positons 2226 - 2228. This clone contains 
an open reading frame of 2190bp, encoding a protein of 730 amino acids with a molecular weight og 83kd 
based on this amino acid sequence. The clone contains sequence identical to the coding region given in 
Table V. This protein is contemplated to represent a primary translation product which is cleaved upon 
50 secretion to produce the hBMP-1 protein. This clone is therefore a cDNA for hBMP-1 corresponding to 
human gene fragment contained in the genomic hBMP-1 sequence lambda LP-H1. It is noted that amino 
acids #550 to #590 of BMP-1 are homologous to epidermal growth factor and the M growth factor" domains 
of Protein C, Factor X and Factor IX. 

55 
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TABLE VI 



10 



15 



20 



25 



30 



35 



40 



45 



50 



10 20 30 (1) 50 

CIAGAGGOOG CITCCCTOGC CGCCGCOCOG CCAGC AIG CCC GGC GIG GCC CGC CIG COG 

MET Pro Gly Veil Ala Arg Leu Pro 

65 80 95 110 

CIG CIG CTC GGG CIG CIG CIG CTC CCG OCT CCC GGC CGG COG CIG GAC TIG GCC 

Leu Leu Leu Gly leu Leu Leu Leu Pro Arg Pro Gly Arg Pro Leu Asp Leu Ala 

125 140 155 

GAC TAG ACC TAT GAC CIG GOG GAG GAG GAC GAC TOG GAG CCC CTC AAC TAG AAA 

Asp Tyr Thr Tyr Asp leu Ala Glu Glu Asp Asp Ser Glu Pro Leu Asn Tyr Lys 

170 185 200 215 

GAC CCC TGC AAG GCG GOT GCC TIT CIT GGG GAC AIT GCC CIG GAC GAA GAG GAC 

Asp Pro Cys lys Ala Ala Ala Hie leu Gly Asp lie Ala Leu Asp Glu Glu Asp 

230 245 260 275 

CIG AGG GCC TIC CAG CTA CAG CAG GCT GIG GAT CTC AGA OGG CAC ACA GCT OCT 

Ifiu Arg Ala Hie Gin Val Gin Gin Ala Val Asp Leu Arg Arg His Bur Ala Arg 

290 305 320 

AAG TOC TCC ATC AAA GCT GCA GIT CCA GGA AAC ACT TCT ACC CCC AGC TGC CAG 

lys Ser Ser lie Lys Ala Ala Veil Pro Gly Asn Thr Ser Thr Pro Ser Cys Gin 

335 350 365 380 

AGC ACC AAC GGG CAG CCT CAG AGG GGA GCC TCT GGG AGA TGG AGA GCT AGA ICC 

Ser Bir Asn Gly Gin Pro Gin Arg Gly Ala Cys Gly Arg Trp Arg Gly Arg Ser 

395 410 425 

OCT AGC OGG CGG GOG GOG AOG TCC OGA CCA GAG OCT GIG TGG CCC GAT GGG CTC 

Arg Ser Arg Arg Ala Ala Thr Ser Arg Pro Glu Arg Val Trp Pro Asp Gly Val 

440 . 455 470 485 

ATC CCC TIT CTC AIT GGG GGA AAC TIC ACT GCT AGC CAG AGG GCA CTC TIC OGG 

lie Pro Hie Val lie Gly Gly Asn Hie Thr Gly Ser Gin Arg Ala Val Hie Arg 

500 515 530 545 

CAG GCC A1G AGG CAC TGG GAG AAG CAC ACC TCT CTC ACC TIC CTG GAG CGC ACT 

Gin Ala MET Arg His Trp Glu lys His Thr Cys Val Thr Hie Leu Glu Arg Thr 

560 575 590 

GAC GAG GAC AGC TAT ATT CTG TTC ACC TAT OGA CCT TGC GGG TGC TGC TCC TAC 

Asp Glu Asp Ser Tyr lie Val Hie Thr Tyr Arg Pro Cys Gly Cys Cys Ser Tyr 

605 620 635 650 

CTG GCT CGC CGC GGC GGG GGC CCC CAG GCC ATC TCC ATC GGC AAG AAC TCT GAC 

Val Gly Arg Arg Gly Gly Gly Pro Gin Ala He Ser He Gly Lys Asn Cys Asp 



55 
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665 630 695 

AAG TTC GGC ATT GIG GTC CAC GAG CDS GGC CAC GTC GTC C-C-C TIC TCG CAC GAA 

Lys Fhe Gly He Val Val His Glu leu Gly His Val Val Gly Phe Trp His Glu 

710 725 740 755 

CAC ACT OGG CCA GAC OGG GAC CGC CAC GTT TCC ATC GIT OGT GAG AAC ATC CAG 

His Thr Arg Pro Asp Arg Asp Arg His Val Ser lie Val Arg Glu Asn He Gin 

770 785 800 815 

CCA GGG CAG GAG TAT AAC TTC CIG AAG ATG GAG CCT CAG GAG GTG GAG TCC CTG 

Pro Gly Gin Glu Tyr Asn Hie Leu Lys MET Glu Pro Gin Glu Val Glu Ser Leu 



830 845 860 

GGG GAG ACC TAT GAC TTC GAC AGC ATC ATS CAT TAC GCT OGG AAC ACA TTC TCC 

Gly Glu Thr Tyr Asp Hie Asp Ser lie MET His Tyr Ala Arg Asn Thr Fhe Ser 

875 890 905 920 

AGG GGC ATC TTC CIG GAT ACC AIT GTC CCC AAG TAT GAG GTG AAC GGG GTG AAA 

Arg Gly He Fhe Leu Asp Thr lie Val Pro Lys Tyr Glu Val Asn Gly Val Lys 

935 950 965 

OCT CCC AIT GGC CAA AGG ACA OGG CTC AGC AAG GGG GAC AIT GCC CAA GCC CGC 
Pro Pro He Gly Gin Arg Thr Arg Leu Ser Lys Gly Asp lie Ala Gin Ala Arg 

98 ° 995 1010 1025 

AAG CIT TAC AAG TGC OCA GCC TGT GGA GAG ACC CIG CAA GAC AGC ACA GGC AAC 

lys leu Tyr Lys Cys Pro Ala Cys Gly Glu Thr Leu Gin Asp Ser Thr Gly Asn 

7-040 1055 1070 1085 

TTC TOC TCC CCT GAA TAC CCC AAT GGC TAC TCT GCT CAC ATG CAC TGC GTG TCG 
Fhe Ser Ser Pro Glu Tyr Pro Asn Gly iyr Ser Ala His MET His Cys Val Trp 

1100 1115 1130 

OGC ATC TCT GTC ACA CCC GGG GAG AAG ATC ATC CTC AAC TTC AOS TOC CTC GAC 
Arg lie Ser Val Thr Pro Gly Glu Lys He lie Leu Asn Fhe Thr Ser Leu Asp 

1145 1160 1175 1190 

CTC TAC CGC AGC OGC CIG TGC TCG TAC GAC TAT GTC GAG GTC CGA GAT GGC TTC 
Leu Tyr Arg Ser Arg Leu Cys Trp Tyr Asp Tyr Val Glu Val Arg Asp Gly Fhe 

1205 1220 1235 

.TCG AGG AAG GOG CCC CTC OGA GGC OGC TTC TCC GGG TCC AAA CTC CCT GAG CCT 
Trp Arg Lys Ala Pro Leu Arg Gly Arg Hie Cys Gly Ser Lys Leu Pro Glu Pro 

1250 1265 1280 1295 

ATC GTC TCC ACT GAC AGC OGC CTC TCG GTT GAA TTC CGC AGC AGC AGC AAT TCG 

lie Val Ser Thr Asp Ser Arg leu Trp Val Glu Phe Arg Ser Ser Ser Asn Trp 

1310 1325 1340 1355 

GTT GGA AAG GGC TIC TIT GCA GTC TAC GAA GCC ATC TCC GGG GGT GAT GIG AAA 
Val Gly Lys Gly Fhe Fhe Ala Val Tyr Glu Ala lie Cys Gly Gly Asp Val Lys 
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1370 1385 1400 

MG GAC TAT GGC CAC ATI CAA. TCG CCC AAC TAC CCA GAC GXI TAC CGG CCC W3C 

Lys Asp Tyr Gly His lie Gin Ser Pro Asn Tyr Pro Aso Aso Tyr Arg Pro Ser 

s * 

1415 1430 1445 1460 

AAA GTC TCC ATC TGG CGG ATC CAG GIG TCT GAG GGC TIC CAC GIG GGC CTC ACA 

Lys Val Cys He Trp Arg He Gin Val Ser Glu Gly Phe His Val Gly leu Thr 

w 1475 1490 1505 

TTC CAG TCC ITT GAG AIT GAG CGC CAC GAC AGC IGT GCC TAC GAC TAT CIG GAG 

Phe Gin Ser Phe Glu lie Glu Arg His Asp Ser Cys Ala Tyr Asp Tyr leu Glu 

1520 1535 1550 1565 

is GIG CGC GAC GGG CAC ACT GAG AGC AGC ACC CTC ATC GGG CGC TAC TCT GGC TAT 

Val Arg Asp Gly His Ser Glu Ser Ser Ihr leu He Gly Arg iyr Cys Gly iyr 

1580 1595 1610 1625 

GAG AAG CCT GAT GAC. ATC AAG AGC ACG TCC AGC CGC CTC TGG CTC AAG TTC CTC 
20 Glu lys Pro Asp Asp lie lys Ser Thr Ser Ser Arg Leu Trp leu Lys Phe Val 

1640 1655 1670 

TCT GAC GGG TOC ATT AAC AAA GOG GGC TIT GOC GTC AAC ITT ITC AAA GAG GTG 

Ser Asp Gly Ser He Asn Lys Ala Gly Phe Ala Val Asn Phe Phe Lys Glu Val 

2j 1685 1700 1715 1730 

GAC GAG TCC TCT CGG CCC AAC OGC GGG GGC TCT GAG CAG CGG TGC CIC AAC ACC 

Asp Glu Cys Ser Arg Pro Asn Arg Gly Gly Cys Glu Gin Arg Cys leu Asn Thr 

1745 1760 1775 

CIG GGC AGC TAC AAG TGC AGC TCT GAC CCC GGG TAC GAG CIG GOC CCA GAC AAG 

leu Gly Ser iyr Lys Cys Ser Cys Asp Pro Gly iyr Glu Leu Ala Pro Asp Lys 

1790 1805 1820 1835 

OGC CGC TCT GAG GCT GCT TCT GGC GGA TTC CTC ACC AAG CTC AAC GGC TCC ATC 
j5 Arg Arg Cys Glu Ala Ala Cys Gly Gly Phe leu Thr Lys T pii Asn Gly Ser lie 

1850 1865 1880 1895 

AOC AGC COG GGC TGG CCC AAG GAG TAC CCC CCC AAC AAG AAC TGC ATC TGG CAG 

Ser Pro Gly Trp Pro lys Glu iyr Pro Pro Asn lys Asn Cys He Trp Gin 

1910 1925 1940 

CIG GTG GCC CCC ACC CAG TAC OGC ATC TCC CIG CAG TIT GAC TTC TIT GAG ACA 

Leu Val Ala Pro Thr Gin Tyr Arg lie Ser Leu Gin Phe Asp Fhe Phe Glu Thr 

45 1955 1970 1985 2000 

GAG GGC AAT GAT GTG TGC AAG TAC GAC TIC GIG GAG GIG CGC ACT GGA CTC ACA 

Glu Gly Asn Asp Val Cys Lys iyr Asp Phe Val Glu Val Arg Ser Gly Leu Thr 

2015 2030 2045 

50 GCT GAC TCC AAG CIG CAT GGC AAG TTC TCT GGT TCT GAG AAG CCC GAG CTC ATC 

Ala Asp Ser Lys leu His Gly Lys Phe Cys Gly Ser Glu Lys Fro Glu Val lie 
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2060 2075 2090 2105 

ACC TCC CAG TAC AAC AAC ATC CGC GTG GAG TTC AAG TOC GAC AAC ACC GIG TCC 

Thr Ser Gin Tyr Asn Asn MET Arg Val Glu Phe Lys Ser Asp Asn Thr val Ser 

23- 20 2135 2150 2165 

AAA AAG GGC TTC AAG GCC CAC TTC TTC TCA GAA AAG AGG CCA GOT CIG CAG COC 

Ijys Lys Gly Fhe Lys Ala His Phe Phe Ser Glu Lys Arg Pro Ala leu Gin Pro 

2180 2195 2210 

CCT OGG GGA CGC CCC CAC . CAG CTC AAA TIC OGA GTC CAG AAA AGA AAC CGG ACC 

Pro Arg Gly Arg Pro His Gin Leu Lys Phe Arg Val Gin Lys Arg Asn Arg Uir 

(730) 

2225 2235 2245 2255 2265 2275 2285 

CCC CAG TGAGGCCIGC CAGGCCTCCC GGACCCCITG TTACICAGGA ACCTCACCIT GGACGGAATC 
Pro Gin 



2295 2305 2315 2325 2335 2345 2355 

GGATGGGGGC 1T03GTGCCC ACCAACCCCC CACCTCCACT CTCCCATTCC GGCCCACCTC CCICIGGCCG 



20 



2365 2375 2385 2395 2405 2415 2425 

GACAGAACTG GTGCTCICIT CTCCCCACTG TGCCOCTCOG CGGACCGGGG ACCCITCCCC GTGCCCIACC 



25 



2435 2445 2455 2465 2475 2485 2495 

CCCTCCCATT TTSATGGTCT CTGTGAGAXT TCCIGTIGTG AAG32AAAAGA GGGACCCCTG CCTOCTGCCT 



30 



CXAGA 



B. hBMP-2: Class 1 and H 

35 The Hindlll-Sacl bovine genomic bBMP-2 fragment described in Example IV B. is subcloned into an 
Ml 3 vector. A ^P-labeled single-stranded DNA probe is made from a template preparation of this subclone. 
This probe is used to screen polyadenylated RNAs from various cell and tissue sources as described above 
in part A. A hybridizing band corresponding to an mRNA species of approximately 3.8 kb is detected in the 
lane containing RNA from the human cell line U-2 OS. The Hindlll-Sacl fragment is labeled with 32 P by nick 
40 translation and used to screen the nitrocellulose filter replicas of the above-described U-2 OS cDNA library 
by hybridization in standard hybridization buffer at 65° overnight followed by washing in 1 X SSC, 0.1% 
SDS at 65°. Twelve duplicate positive clones are picked and replated for secondaries. Duplicate nitrocel- 
lulose replicas are made of the secondary plates and both sets hybridized to the bovine genomic probe as 
the primary screening was performed. One set of filters is then washed in 1 X SSC, 0.1% SDS; the other in 
45 0.1 X SSC, 0.1 % SDS at 65°. 

Two classes of hBMP-2 cDNA clones are evident based on strong (4 recombinants) or weak (7 
recombinants) hybridization signals under the more stringent washing conditions (0.1 X SSC, 0.1% SDS). 
All 11 recombinant bacteriophage are plaque purified, small scale DNA preparations made from plate 
lysates of each, and the inserts subcloned into pSP65 and into Ml 3 for sequence analysis. Sequence 
so analysis of the strongly hybridizing clones designated hBMP-2 Class I (also known as BMP-2) indicates that 
they have extensive sequence homology with the sequence given in Table III. These clones are therefore 
cDNA encoding the human equivalent of the protein encoded by the bBMP-2 gene whose partial sequence 
is given in Table III. Sequence analysis of the weakly hybridizing recombinants designated hBMP-2 Class II 
(also known as BMP-4) indicates that they are also quite homologous with the sequence given in Table III at 
55 the 3* end of their coding regions, but less so in the more 5' regions. Thus they encode a human protein of 
similar, though not identical, structure to that above. 

Full length hBMP-2 Class I cDNA clones are obtained in the following manner. The 1 .5 kb insert of one 
of the Class II subclones (11-10-1) is isolated and radioactively labeled by nick-translation. One set of the 
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nitrocellulose replicas of the U-2 OS cDNA library screened above (50 filters, corresponding to 1 ,000,000 
recombinant bacteriophage) are rehybridized with this probe under stringent conditions (hybridization at 65° 
in standard hybridization buffer; washing at 65° in 0.2 X SSC, 0.1% SDS). All recombinants which hybridize 
to the bovine genomic probe which do not hybridize to the Class II probe are picked and plaque purified (10 
recombinants). Plate stocks are made and small scaie bacteriophage DNA preparations made. After 
subcloning into Ml 3, sequence analysis indicates that 4 of these represent clones which overlap the original 
Class I clone. One of these, lambda U20S-39, contains an approximately 1.5 kb insert and was deposited 
with the ATGC on June 16, 1987 under accession number 40345. The partial DNA sequence (compiled 
from lambda U20S-39 and several, other hBMP-2 Class I cDNA recombinants) and derived amino acid 
sequence are shown below in Table VII. Lambda U20S-39 is expected to contain all of the nucleotide 
sequence necessary to encode the entire human counterpart of the protein BMP-2 Class I encoded by the 
bovine gene segment whose partial sequence is presented in Table III. This human cDNA hBMP-2 Class I 
contains an open reading frame of 1188 bp, encoding a protein of 396 amino acids. This protein of 396 
amino acids has a molecular weight of 45kd based on this amino acid sequence. It is contemplated that this 
sequence represents the primary translation product. The protein is preceded by a 5* untranslated region of 
342 bp with stop codons in all frames. The 13 bp region preceding this 5’ untranslated region represents a 
linker used in the cDNA cloning procedure. 
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10 20 30 40 50 60 70 

GTOGACTCIA GAGIGIOIGT CAGCACTIGG CIGC-GGACIT CTTGAACITG CAGGGAGAAT AACTTGOOCA 

80 90 100 110 120 130 140 

CCCCACTTTG OGCOGGIGCC TTIGCCOCAG OGGAGCCIGC TICGCCATCr CCGAGCCCCA COGCCCCICC 

15° 160 170 180 190 200 210 

ACTCCIOGGC CITGCCOGAC ACIGAGACGC TGTICCCAGC GTGAAAAGAG AGACIGCGOG GCCEGCACCC 



75 



220 230 240 250 260 270 280 

GGGAGAAGGA GGAGGCAAAG AAAAGGAACG GACATTCGGr CCITGCGCCA GGTCCITIGA CCAGAGTT Ti 1 



20 



290 300 310 320 330 340 350 

TCCATGTGGA OGCICITTCA ATGGAOGIGT CCCCGCGTGC TTCITAGACG GACIGOGGTC TCCIAAAGGT 



25 



30 



35 



40 



45 



(1) 370 385 400 

CGACC AIG GIG GCC GGG ACC OGC TCT CIT CIA GOG TTG CIG CTT CCC CAG GTC 

MET Val Ala Gly Thr Arg Cys Leu leu Ala Leu Leu Leu Fro Gin Val 

415 430 445 

CTC CIG GGC GGC GCG GOT GGC CIO GIT COG GAG CIG GGC OGC AGG AAG TTC GOG 

Leu Leu Gly Gly Ala Ala Gly Leu Val Pro Glu Leu Gly Arg Arg Lys Phe Ala 

460 475 490 505 

GOG GOG TOG TOG GGC OGC CCC TCA TOC CAG COC TCT GAC GAG GTC CIG AGO GAG V 

Ala Ala Ser Ser Gly Arg Pro Ser Ser Gin Pro Ser Asp Glu Val Leu Ser Glu 

_ 520 535 550 565 

TTC GAG TTG OGG CIG CIO AGC ATG TTC GGC CIG AAA CAG AGA CCC ACC CCC AGO 

.Phe Glu leu Arg Leu Leu Ser MET Fhe Gly Leu Lys Gin Arg Pro Thr Pro Ser 

580 595 ' 610 
AGG GAC GCC GIG GIG COC CCC TAC AIG CIA GAC CIG TAT OGC AGG CAC TOG GGT 

Arg Asp Ala Val Val Pro Pro Tyr MET Leu Asp Leu Tyr Arg Arg His Ser Gly 

625 640 655 670 

CAG COG GGC TCA CCC GCC CCA GAC CAC OGG TIG GAG AGG GCA GCC AGC OGA GCC 

Gin Pro Gly Ser Pro Ala Pro Asp His Arg Leu Glu Arg Ala Ala Ser Arg Ala 

685 700 715 

AAC ACT GTG OGC AGC TIC CAC CAT GAA GAA TCT TIG GAA GAA CIA CCA GAA ACG 

Asn Thr Val Arg Ser Fhe His His Glu Glu Ser Leu Glu Glu Leu Pro Glu Thr 



50 



55 
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730 745 760 775 

ACT GGG AAA ACA ACC CGG AGA TTC TIC TIT AAT TTA ACT TCT ATC CCC ACC. GAG 

Ser Gly Lys Thr Thr Arg Arg Phe Hie Phe Asn Leu Ser Ser lie Pro Thr Glu 

5 

790 805 820 835 

GAG TIT ATC ACC TCA GCA GAG CIT CAG GOT TIC OGA GAA CAG ATG CAA GAT GCT 

Glu Phe lie Thr Ser Ala Glu leu Gin Val Phe Arg Glu Gin MET Gin Asp Ala 

io 850 865 880 

TIA GGA AAC AAT AGO ACT TIC CAT CAC OGA AIT AAT AIT TAT GAA ATC ATA AAA 

Leu Gly Asn Asn Ser Ser Hie His His Arg lie Asn lie iyr Glu He lie Lys 

895 910 925 940 

, 5 CCT GCA ACA GOC AAC TOG AAA TIC CCC GIG ACC ACT CIT TIG GAC ACC AGG TIG 
Pro Ala Dir Ala Asn Ser Lys Phe Fro Val Dir Ser Leu Leu Asp Thr Arg Leu 

955 970 985 

GIG AAT CAG AAT GCA AGC AGG TGG GAA ACT TIT GAT GIC ACC CCC GOT . GTG ATG 
20 Val Asn Gin Asn Ala Ser Arg Trp Glu Ser Phe Asp Val Thr Pro Ala Val MET 

1000 1015 1030 1045 

OGG TGG ACT GCA CAG GGA CAC GOC AAC CAT GGA TIC GIG GIG GAA GIG GOC CAC 

Arg Trp Thr Ala Gin Gly His Ala Asn His Gly Phe Val Val Glu Val Ala His 

25 1060 1075 1090 U05 

TIG GAG GAG AAA CAA GCT GIC TOC AAG AGA CAT GIT AGG ATA AGC AGG TCT TIG 

Leu Glu Glu Lys Gin Gly Val Ser Lys Arg His Val Arg lie Ser Arg Ser Leu 

1120 1135 1150 

30 CAC CAA GAT GAA CAC AGC TGG TCA CAG ATA AGG CCA TIG CIA GEA ACT TIT GGC 
His Gin Asp Glu His Ser Trp Ser Gin lie Arg Pro Leu Leu Val Thr. Phe Gly 



35 



40 



1165 H80 1195 1210 

CAT GAT GGA AAA GGG CAT CCT CTC CAC AAA AGA GAA AAA OCT CAA GCC AAA CAC 

His Asp Gly Lys Gly His Pro Leu His Lys Arg Glu Lys Arg Gin Ala Lys His 

1225 1240 1255 

AAA CAG CGG AAA CGC CIT AAG TCC AGC TGT AAG AGA CAC CCT TIG TAC GIG GAC 

Lys Gin Arg Lys Arg Leu Lys Ser Ser Cys Lys Arg His Pro Leu Tyr Val Asp 

1270 1285 " " 1300 1315 

•TTC ACT GAC GIG GGG TGG AAT GAC TGG AIT GIG GOT CCC COG GGG TAT CAC GCC 

Fhe Ser Asp Val Gly Trp Asn Asp Trp lie Val Ala Pro. Pro Gly Tyr His Ala 



45 1330 1345 1360 1375 

TIT TAC IGC CAC GGA GAA TGC CCT TOT CCT CIG GCT GAT CAT CIG AAC TCC ACT 
Hie Tyr Cys His Gly Glu Cys Pro Fhe Pro Leu Ala Asp His Leu Asn Ser Thr 



so 



1390 1405 1420 

AAT CAT GOC AIT GOT CAG AGG TIG CTC AAC TCT GOT AAC TCT AAG AIT OCT AAG 
Asn His Ala lie Veil Gin Dir Leu Val Asn Ser Val Asn Ser Lys lie Fro Lys 



55 



21 




EP 0 688 869 A1 



1435 1450 1465 1480 

GCA TGC TCT GTC COG ACA GAA CIO ACT C-CT ATC TOG ATG CTG TAC CTT GAC GAG 

Ala Cys Cys Val Pro Thr Glu IfiU Ser Ala He Ser H2T Leu Tyr Leu Asp Glu 

5 1495 1510 1525 

AAT GAA AAG CTT GIA TIA AAG AAC TAT CAG GAC ATG CTT CTG GAG GCT TCT GGG 

Asn Glu Lys Val Val leu Lys Asn Tyr Gin Asp MET Val Val Glu Gly Cys Gly 

. 1540(396) 1553 1563 1573 1583 1593 1603 

io TCT CGC TACTACAGCA AAATEAAATA CATAAAIAIA TAIAIATATA TAIA TITI AG AAAAAAGAAA 

Cys Arg 



AAAA 

75 

Full-length hBMP-2Clas$ II human cDNA clones are obtained in the following manner. The 200 bp 
EcoRI-SacI fragment from the 5* end of the Class II recombinant 11-10-1 is isolated from its plasmid 
subclone, labeled by nick-translation, and hybridized to a set of duplicate nitrocellulose replicas of the U-2 
20 OS cDNA library (25 filters/set; representing 500,000 recombinants). Hybridization and washing are 
performed under stringent conditions as described above. 16 duplicate positives are picked and replated for 
secondaries. Nitrocellulose filter replicas of the secondary plates are made and hybridized to an 
oligonucleotide which was synthesized to correspond to the sequence of 11-10-1 and is of the following 
sequence: 

25 CGG6CGCT CAGG ATACT CAAG ACCAGTGCTG 

Hybridization is in standard hybridization buffer AT 50° C with washing at 50° in 1 X SSC, 0.1% SDS. 14 
recombinant bacteriophage which hybridize to this oligonucleotide are plaque purified. Plate stocks are 
made and small scale bacteriophage DNA preparations made. After sucloning 3 of these into Ml 3, 
sequence analysis indicates that they represent clones which overlap the original Class II clone. One of 
30 these, lambda U20S-3, was deposited with the ATCC under accession number 40342 on June 16, 1987. 
U20S-3 contains an insert of approximately 1.8 kb. The partial DNA sequence and derived amino acid 
sequence of U20S-3 are shown below in Table VIII. This clone is expected to contain all of the nucleotide 
sequence necessary to encode the entire human BMP-2 Class II protein. This cDNA contains an open 
reading frame of 1224 bp, encoding a protein of 408 amino acids, preceded by a 5' untranslated region of 
35 394 bp with stop codons in all frames, and contains a 3' untranslated region of 308 bp following the in- 

frame stop codon. The 8 bp region preceding the 5' untranslated region represents a linker used in the 
cDNA cloning procedure. This protein of 408 amino acids has molecular weight of 47kd and is con- 
templated to represent the primary translation. product. 

40 



45 



so 
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TABLE VIII 

10 20 30 40 50 60 70 

CTCIAGAGGG CAGAGGAGGA GGGAGGGAGG GAAGGAGOGC GGAGCCOGGC CCGGAAGCIA GGTGAGTCTG 



80 90 100 110 120 130 140 

GCATCOGAGC TGAGGGAOGC GAGCCTGAGA CGCOGCIGCT GCICCGGCTG AGTATCIAGC TICTCTCCCC 



150 160 170 180 190 200 210 

GATGGGATTC CCGTCCAAGC TATCTOGAGC CIGCAGOGOC ACAGICCCCG GCOCTOGCCC AGCTTCACIG 



220 230 240 250 260 270 280 

CAAOOCTTCA GAGCTCCCCA GGAGCIGCIG CTCGCGAGCC CGCIACTGCA GGGAOCEATC GAGCCA3TCC 



290 300 310 320 330 340 350 

CTAGTGCCAT COCGAGCAAC GCACIGCIGC AGCITCCCIG AGCCITTCCA GCAACTITCT TCAAGAITGG 



360 370 380 390 400 (1) 

CTGTCAAGAA TCATCSACTG TTATTAIATG CCITCITITC TGTCAAGACA CC ATC ATT OCT 

MET lie Pro 

417 432 447 462 

GCT AAC OGA ATC CTO ATC GTC GIT TTA TEA TCC CAA CTC CTG CIA GGA GGC GOG 

Gly Asn Arg MET Leu MET Val Val Leu Leu Cys Gin Val Leu Leu Gly Gly Ala 

477 492 507 

AGO CAT GCT ACT TEG AIA CCT GAG AOG GGG AAG AAA AAA GTC GCC GAG AIT CAG 

Ser His Ala Ser Leu lie Pro Glu Ihr Gly Lys Lys Lys Val Ala Glu lie Gin 

522 537 552 567 

GGC CAC GOG GGA GGA OGC OGC TCA GGG CAG AGO CAT GAG CTC CIG OGG GAC TIC 

Gly His Ala Gly Gly Arg Arg Ser Gly Gin Ser His Glu Leu Leu Arg Asp Rie 

582 597 612 627 

GAG GOG ACA CIT CIG CAG ATC TIT GGG CIG OGC OGC OGC COG CAG CCT AGC AAG 

-Glu Ala Tir Leu Leu Gin MET Rie Gly Leu Arg Arg Arg Pro Gin Pro Ser Lys 

642 657 672 

ACT GCC GTC AIT COS GAC TAC ATC OGG GAT CIT TAC OGG CIT CAG TCT GGG GAG 

Ser Ala Val lie Pro Asp Tyr MET Arg Asp Leu iyr Arg Leu Gin Ser Gly Glu 

687 702 717 732 

GAG GAG GAA GAG CAG ATC CAC AGC ACT GCT CTT GAG TAT CCT GAG OGC COG GCC 

Glu Glu Glu Glu Gin lie His Ser Thr Gly Leu Glu Tyr Pro Glu Arg Pro Ala 
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25 



30 



35 



40 



45 



50 



747 762 777 

AGC OGG GCC AAC ACC GIG AGG AC-C TIC CAC CAC GAA GAA CAT CIC GAG AAC AIC 
Ser Arg Ala Asn Thr Val Arg Ser Fhe His His Glu Glu His Leu Glu Asn He 

792 807 822 837 

CCA GGG ACC AGT GAA AAC TCP GCT TIT OGT TTC CTC TIT AAC CIC AGC AGC AIC 

Pro Gly Thr Ser Glu Asn Ser Ala Fhe Arg Fhe Leu Fhe Asn Leu Ser Ser lie 

852 867 882 897 

CCT GAG AAC GAG GIG . AIC TCC TCT GCA GAG CIT OGG CIC TIC OGG GAG CAG GIG 
Pro Glu Asn Glu Val He Ser Ser Ala Glu Leu Arg Leu Fhe Arg Glu Gin Val 

912 927 942 

GAC CAG GGC CCT GAT IGG GAA AGG GGC TTC CAC OGT AIA AAC AIT TAT GAG GIT 
Asp Gin Gly Pro Asp Trp Glu Arg Gly Fhe His Arg lie Asn lie Tyr Glu Val 

957 972 987 1002 

CCA GCA GAA CTS 0115 OCT GGG CAC CTC ATC ACA OGA CIA CTG GAC 

MET Lys Pro Pro Ala Glu Val Val Pro Gly His Leu lie Thr Arg Ton Asp 

T017 1032 1047 

W3G.AGA CTG GTC CAC CAC AAT GIG ACA OGG IGG GAA ACT ITT GAT GIG AGC CCT 
Thr Arg Leu Val His His Asn Val Thr Arg Tip Glu Thr Fhe Asp Val Ser Pro 

1062 T077 1092 H07 

GOG GTC CIT OGC IGG AOC OGG GAG AAG CAG CCA AAC TAT GGG CIA GCC AIT GAG 

Ala Val leu Arg lip Thr Arg Glu Lys Gin Pro Asn Tyr Gly Leu Ala lie Glu 

H22 1137 1152 1167 

H 1 ? S 10 C*? C?lG ACT OGG AOC CAC CAG GGC CAG CAT GTC AGG AIT 'AGC 

al Ihr His Leu His Gin Thr Arg Thr His Gin Gly Gin His Val Arg lie Ser 

1182 1197 1212 

OGA TOG TIA CCT CAA GGG ACT GGG AAT IGG GCC CAG CIC OGG CCC CTC CTG GTC 
Arg Ser leu Pro Gin Gly Ser Gly Asn Trp Ala Gin Leu Arg Pro Leu Leu Val 

1227 1242 1257 1272 

Ihr Fhe Gly His Asp Gly Arg Gly His Ala leu Ihr Arg Arg Arg Arg Ala Lys 

1287 1302 1317 

^AGCCCTAAGCATCACTCACAGCGGGCCAGGAAGAAGAATAAGAACTGCaGG 

Arg Ser Pro Lys His His Ser Gin Arg Ala Arg Lys Lys Asn Lys Asn Cys Arg 

iP 2 1347 1362 1377 

OGC CAC TOG CTC TAT GIG GAC TIC AGC GAT GIG GGC IGG AAT GAC TGG AIT GIG 

Arg His Ser Leu Tyr Val Asp Fhe Ser Asp Val Gly Trp Asn Asp Tip lie Val 

1392 1407 1422 1437 

GCC CCA CCA GGC TAC CAG GCC TTC TAC TGC CAT GGG GAC TGC CCC TIT CCA CIG 
Ala Pro Pro Gly Tyr Gin Ala Fhe Tyr Cys His Gly Asp Cys Pro Fhe Pro Leu 
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1452 1467 1482 

GCT GAC CAC CTC AAC TCA ACC AAC CAT GCC AIT GIG CAG ACC CIG GIC AAT TCT 
Ala Asp His Leu Asn Ser Thr Asn His Ala lie Val Gin Thr Ieu Val Asn Ser 



1497 1512 1527 1542 

GTC AAT TOC ACT ATC COC AAA GCC IGT TGT GIG CCC ACT GAA CIG AGT GCC ATC 
Val Asn Ser Ser lie Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala lie 



w 



15 



1557 1572 1587 

TOC A3G CIG TAG CIG GAT GAG TAT GAT AAG GIG GTA CIG AAA AAT TAT CAG GAG 
Ser MET leu Tyr leu Asp Glu Tyr Asp Lys Val Val Leu Lys Asn iyr Gin Glu 

1602 1617 (408) 1636 1646 1656 

AIG GTA GTA GAG GGA TGT GGG TGC CGC TGAGATCAGG CAGICCITGA GGATAGACAG 
MET Val Val Glu Gly Cys Gly Cys Arg 



20 



1666 1676 1686 1696 1706 1716 1726 

ATATACACAC CACACACACA CACCACATAC AOCACACACA CACGTTCCCA TCCACICACC CACACACIAC 

1736 1746 1756 1766 1776 1786 1796 

ACAGACIGCT TCX'I'JJATAGC TGGACITTEA 1T1AAAAAAA AAAAAAAAAA AATGGAAAAA ATOOCIAAAC 



25 



1806 1816 1826 1836 1846 1856 1866 

ATTCACCTIG ACCITAITIA IGACTITACG TGCAAAIGTT TIGACCAIAT TGATCATATA I'l'l'l GACAAA 



187 6 1886 1896 1906 1916 1926 1936 

atatatttat aactacgiat taaaagaaaa aaaiaaaaig agtcaitatt ttaaaaaaaa aaaaaaaact 

1946 

CIAGAGICGA OGGAAITC 

35 

The sequences of BMP-2 Class I and II, as well as BMP-3 as shown in Tables III, IV, VII and VIII have 
significant homology to the beta (B) and beta (A) subunits of the inhibins. The inhibins are a family of 
hormones which are presently being investigated for use in contraception. See, A. J. Mason et al, Nature, 
318:659-663 (1985). To a lesser extent they are also homologous to Mullerian inhibiting substance (MIS), a 
40 testicular glycoprotein that causes regression of the Mullerian duct during development of the male embryo 
and transforming growth factor-beta (TGF-b) which can inhibit or stimulate growth of cells or cause them to 
differentiate. Furthermore, the sequence of Table VII encoding hBMP-2 Class II has significant homology to 
the Drosophila decapentaplegic (DPP-C) locus transcript. See, J. Massague, Cell, 49:437-438 (1 987); R. W. 
Padgett et al, Nature , 325:81-84 (1987); R. L. Cate et al, Cell 45: 685-698 (1986). It is considered possible 
45 therefore that BMP-2 Class II is the human homolog of the protein made from this transcript from this 
developmental mutant locus. 



C. BMP-3 



50 Because bovine and human bone growth factor genes are presumed to be significantly homologous, 
oligonucleotide probes which have been shown to hybridize to the bovine DNA sequence of Table IV.A and 
IV. B are used to screen a human genomic library. A human genomic library (Toole et al., supra ) is screened 
using these probes, and presumptive positives are isolated and DNA sequence obtained as described 
above. Evidence that this recombinant encodes a portion of the human bone inductive factor molecule relies 
55 on the bovine/human protein and gene structure homologies. 

Once a recombinant bacteriophage containing DNA encoding a portion of the human BMP-3 molecule 
is obtained the human coding sequence is used as a probe as described in Example V (A) to identify a 
human cell line or tissue which synthesizes BMP-3. mRNA is selected by oligo (dT) cellulose chromatog- 
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raphy and cDNA is synthesized and cloned in lambda gtIO by established techniques (Toole et al., supra). 

Alternatively, the entire gene encoding this human bone inductive factor can be identified and obtained 
in additional recombinant clones if necessary. Additional recombinants containing further 3' or 5* regions of 
this human bone inductive factor gene can be obtained by identifying unique DNA sequences at the end(s) 
5 of the original clone and using these as probes to rescreen the human genomic library. The gene can then 
be reassembled in a single plasmid by standard molecular biology techniques and amplified in bacteria. 
The entire human BMP-3 factor gene can then be transferred to an appropriate expression vector. The 
expression vector containing the gene is then transfected into a mammalian cell, e.g. monkey COS cells, 
where the human gene is transcribed and the RNA correctly spliced. Media from the transfected cells are 
to assayed for bone inductive factor activity as described herein as an indication that the gene is complete. 
mRNA is obtained from these cells and cDNA synthesized from this mRNA source and cloned. The 
procedures described above may similarly be employed to isolate other species' bone inductive factor of 
interest by utilizing the bovine bone inductive factor and/or human bone inductive factor as a probe source. 
Such other species' bone inductive factor may find similar utility in, inter alia, fracture repair. 

75 

EXAMPLE VI 

Expression of Bone Inductive Factors. 

20 In order to produce bovine, human or other mammalian bone inductive factors, the DNA encoding it is 
transferred into an appropriate expression vector and introduced into mammalian cells by conventional 
genetic engineering techniques. 

One skilled in the art can construct mammalian expression vectors by employing the sequence of 
Tables II-VIII or other modified sequences and known vectors, such as pCD [Okayama et al., Mol. Cell Biol., 
25 2:161-170 (1982)] and pJL3, pJL4 [Gough et al., EMBO J., 4:645-653 (1985)]. The transformation of these 

vectors into appropriate host cells can result in expression of osteoinductive factors. One skilled in the art 
could manipulate the sequences of Tables II-VIII by eliminating or replacing the mammalian regulatory 
sequences flanking the coding sequence with bacterial sequences to create bacterial vectors for intracellular 
or extracellular expression by bacterial cells. For example, the coding sequences could be further 
30 manipulated (e.g. ligated to other known linkers or modified by deleting non-coding sequences there-from 
or altering nucleotides therein by other known techniques). The modified bone inductive factor coding 
sequence could then be inserted into a known bacterial vector using procedures such as described in T. 
Taniguchi et al., Proc. Natl Acad. Sci. USA , 77:5230-5233 (1980). This exemplary bacterial vector could 
then be transformed into bacterial host cells and bone inductive factor expressed thereby. For a strategy for 
35 producing extracellular expression of bone inductive factor in bacterial cells., see, e.g. European patent 
application EPA 177,343. 

Similar manipulations can be performed for the construction of an insect vector [See, e.g. procedures 
described in published European patent application 155,476] for expression in insect cells. A yeast vector 
could also be constructed employing yeast regulatory sequences for intracellular or extracellular expression 
40 of the factors of the present invention by yeast cells. [See, e.g., procedures described in published PCT 
application W086/00639 and European patent application EPA 123,289]. 

A method for producing high levels of an osteoinductive factor of the invention from mammalian cells 
involves the construction of cells containing multiple copies of the heterologous bone inductive factor gene. 
The heterologous gene can be linked to an amplifiable marker, e.g. the dihydrofolate reductase (DHFR) 
45 gene for which cells containing increased gene copies can be selected for propagation in increasing 
concentrations of methotrexate (MTX) according to the procedures of Kaufman and Sharp, J. Mol. Biol., 
159:601-629 (1982). This approach can be employed with a number of different cell types. 

For example, a plasmid containing a DNA sequence for a bone inductive factor of the invention in 
operative association with other plasmid sequences enabling expression thereof and the DHFR expression 
so plasmid pAdA26SV(A)3 [Kaufman and Sharp, Mol. Cell. Biol. , 2:1304 (1982)] can be co-introduced into 
DHFR-deficient CHO cells, DUKX-BII, by calcium phosphate coprecipitation and transfection. DHFR ex- 
pressing transformants are selected for growth in alpha media with dialyzed fetal calf serum, and 
subsequently selected for amplification by growth in increasing concentrations of MTX (sequential steps in 
0.02, 0.2, 1.0 and 5uM MTX) as described in Kaufman et al., Mol Cell Biol. , 5:1750 (1983). Transformants 
55 are cloned, and biologically active bone inductive factor expression is monitored by rat bone formation 
assay. Bone inductive factor expression should increase with increasing levels of MTX resistance. Similar 
procedures can be followed to produce other bone inductive factors. 
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Alternatively, the human gene is expressed directly, as described above. Active bone inductive factor 
may be produced in bacteria, or yeast cells. However the presently preferred expression system for 
biologically active recombinant human bone inductive factor is stably transformed CHO cells. 

As one specific example, to produce the human bone inductive factor (hBMP-1) of Example V, the 
5 insert of U20S-1 is released from the vector arms by digestion with Sal I and subcloned, into the 
mammalian expression vector pMT2CX digested with Xho I. Plasmid DNA from this subclone is transfected 
into COS cells by the DEAE-dextran procedure [Sompayrac and Danna PNAS 78:7575-7578 (1981): 
Luthman and Magnusson, Nucl.Acids Res . 11: 1295-1308 (1983)]. Serum-free 24 hr. conditioned medium is 
collected from the cells starting 40-70 hr. post-transfection. 
w The mammalian expression vector pMT2 Cla-Xho (pMT2 CX) is a derivative of p91023 (b) (Wong et al., 
Science 228:810-815, 1985) differing from the latter in that it contains the ampicillin resistance gene in 
place of the tetracycline resistance gene and further contains a Xhol site for insertion of cDNA clones. The 
functional elements of pMT2 Cla-Xho have been described (Kaufman, R.J., 1985, Proc. Natl: Acad. Sci. USA 
82:689-693) and include the adenovirus VA genes, the SV40 origin of replication including the 72 bp 
is enhancer, the adenovirus major late promoter including a 5' splice site and the majority of the adenovirus 
tripartite leader sequence present on adenovirus late mRNAs, a 3' splice acceptor site, a DHFR insert, the 
SV40 early polyadenylation site (SV40), and pBR322 sequences needed for propagation in E. coli. 

Plasmid pMT2 Cla-Xho is obtained by EcoRI digestion of pMT2-VWF, which has been” deposited with 
the American Type Culture Collection (ATCC), Rockville, MD (USA) under accession number ATCC 67122. 
20 EcoRI digestion excises the cDNA insert present in pMT2-VWF, yielding pMT2 in linear form which can be 
ligated and used to transform E. coji HB 101 or DH-5 to ampicillin resistance. Plasmid pMT2 DNA can be 
prepared by conventional methods. pMT2CX is then constructed by digesting pMT2 with Eco RV and Xbal, 
treating the digested DNA with Klenow fragment of DNA polymerase I, and ligating Cla linkers (NEBiolabs, 
CATCGATG). This removes bases 2266 to 2421 starting from the Hind III site near the SV40 origin of 
25 replication and enhancer sequences of pMT2. Plasmid DNA is then digested with EcoRI, blunted as above, 
and ligated to an EcoRI adapter, 

5’ PO 4 -AATTCCTCGAGAGCT 3' 

3’ GGAGCTCTCGA 5' 

digested with Xhol, and ligated, yielding pMT2 Cla-Xho, which may then be used to transform E. coji to 
30 ampicillin resistance. Plasmid pMT2 Cla-Xho DNA may be prepared by conventional methods. 

Example VII 

Biological Activity of Expressed Bone Inductive Factor 
35 

A. BMP-1 

To measure the biological activity of the expressed bone inductive factor, (hBMP-1) obtained in 
Example VI above. The factor is partially purified on a Heparin Sepharose column. 4 ml of transfection 
40 supernatant from one 100 mm dish is concentrated approximately 10 fold by ultrafiltration on a YM 10 
membrane and then dialyzed against 20mM Tris, 0.15 M NaCI, pH 7.4 (starting buffer). This material is then 
applied to a 1.1 ml Heparin Sepharose column in starting buffer. Unbound proteins are removed by an 8 ml 
wash of starting buffer, and bound proteins, including BMP-1, are desorbed by a 3-4 ml wash of 20 mM 
Tris, 2.0 M NaCI, pH 7.4, 

45 The proteins bound by the Heparin column are concentrated approximately 10-fold on a Centricon 10 
and the salt reduced by diafiltration with 0.1% trifluoroacetic acid. The appropriate amount of this solution is 
mixed with 20 mg of rat matrix and then assayed for jn vivo bone and cartilage formation as previously 
described in Example III. A mock transfection supernatant fractionation is used as a control. 

The implants containing rat matrix to which specific amounts of human BMP-1 have been added are 
so removed from rats after seven days and processed for histological evaluation. Representative sections from 
each implant are stained for the presence of new bone mineral with von Kossa and acid fuschin, and for the 
presence of cartilage-specific matrix formation using, toluidine blue. The types of cells present within the 
section, as well as the extent to which these cells display phenotype are evaluated. 

Addition of human BMP-1 to the matrix material resulted in formation of cartilage-like nodules at 7 days 
55 post implantation. The chondroblast-type cells were recognizable by shape and expression of metach- 
romatic matrix. The amount of activity observed for human BMP-1 was dependent upon the amount of 
human BMP-1 protein added to the matrix. Table IX illustrates the dose-response relationship of human 
BMP-1 protein to the amount of bone induction observed. 
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Table IX 



IMPLANT NUMBER 


AMOUNT USED (equivalent of ml transfection media) 


HISTOLOGICAL SCORE 


876-134-1 


10 BMP-1 


C + 2 


876-134-2 


3 BMP-1 


C + 1 


876-134-3 


1 BMP-1 


c + /- 


876-134-4 


10 MOCK 


c- 


876-134-5 


3 MOCK 


c - 


876-134-6 


1 MOCK 


c- 


Cartilage (c) activity was scored on a scale from O(-) to 5. 



Similar levels of activity are seen in the Heparin Sepharose fractionated COS cell extracts. Partial 
15 purification is accomplished in a similar manner as described above except that 6 M urea is included in all 
the buffers. Further, in a rat bone formation assay as described above, BMP-2 has similarly demonstrated 
chondrogenic activity. 

The procedures described above may be employed to isolate other bone inductive factors of interest by 
utilizing the bovine bone inductive factors and/or human bone inductive factors as a probe source. Such 
20 other bone inductive factors may find similar utility in, inter alia, fracture repair. 

The foregoing descriptions detail presently preferred embodiments of the present invention. Numerous 
modifications and variations in practice thereof are expected to occur to those skilled in the art upon 
consideration of these descriptions. Those modifications and variations are believed to be encompassed 
within the claims appended hereto. 
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Claims 



1. A gene encoding bovine BMP-3 comprising the following DNA sequence: 

5 



10 



15 



20 



25 



30 



35 



40 



45 



383 393 403 413 423 

GAGGAGGAAG OGGIXTAQGG GCCTOCTTCT GCCTCIGCAG AAC AAT GAG CIT COT GGG GCA 

Asn Asn Glu Leu Fro Glv Ala 





443 








458 




473 






438 


GAA TAT 


CAG 


TAC AAG 


GAG 


GAT 


GAA 


GTA TCG GAG GAG 


AGG 


AAG CCT 


TAC AAG ACT 


Glu Tyr 


Gin 


Tyr Lys. 


Glu 


Asp 


Glu 


Val Trp Glu Glu 


•Arg 


Lys Pro 


Tyr Lys Thr 






503 








518 




533 






CIT CAG 


ACT 


CAG CCC 


CCT 


GAT 


AAG 


AGT AAG AAC AAA 


AAG 


AAA CAG 


AGG 


AAG GGA 


Leu Gin 


Thr 


Gin Pro 


Pro 


Asp 


Lys 


Ser Lys Asn Lys 


Lys 


Lys Gin 


Arg Lys Gly 


548 








563 




578 








593 


CCT CAG 


CAG 


AAG AGT 


CAG 


ACG 


CTC 


CAG TTT GAT GAA 


CAG 


ACC CTG 


AAG 


AAG GCA 


Pro Gin 


Gin 


Lys Ser 


Gin 


Thr 


Leu 


Gin Pie Asp Glu 


Gin 


Thr Leu 


Lys 


Lys Ala 






608 








623 




638 






AGA AGA 


AAG 


CAA TGG 


ATT 


GAA 


CCC 


QGG AAT TCT GCC 


AGA 


OGG TAC 


CIT 


AAA GIG 


Arg Arg 


Lys 


Gin Trp 


lie 


C-lu 


Fro 


Arg Asn Cys Ala 


Arg 


Arg Tyr 


Leu 


Lys , Val 


653 






668 






683 






698 




GAC TTC 


GCA 


GAT ATT 


GGC 


TGG 


AGO 


GAA TGG ATI ATT- 


TCC 


CCC AAG 


TCC 


TTC GAT 


Asp Fhe 


Ala 


Asp He 


Gly 


Trp 


Ser 


Glu Trp- He He 


Ser 


Pro Lys 


Ser 


Phe Asp • 




713 








723 




743 




756 766 


GCC TAT 


TAC 


TCC TCC 


GGA 


GCG 


TCC 


CAG TTC CCC ATC 


CCA 


AAG GTAGCCATTC TTTTITCTCT 


Ala Tyr 


Tyr 


Cys Ser 


Gly 


Ala 


cys 


Gin Fhe Pro MET 


Pro 


Lys 







776 786 

TCTCCTTCOC A37ITCCATAG ; ard 



284 294 304 319 

CTPlACCIGIG TTCTCCCTiT TOGITCCIAG TCT TIG AAG CCA TCA AAT CAG GCT ACC 

Ser Leu Lvs Pro Ser Asn His Ala Thr 

334 349 364 379 

ATC CAG AGT ATA GIG AGA GCT GIG GGG GTC GIC CCT GGA ATC CCT GAG CCT TCC 

lie Gin Ser lie Val Arg Ala Val Glv Val Val Pro Gly lie Pro Glu Pro Cys 

394 409 424 439 

TGT GIG CCA GAA AAG ATC TCC TCA CTC AGO ATC- TLA TTC TTT GAT GAA AAC AAG 

Cys Val Pro Glu Lvs MET Ser Ser Leu Ser lie Leu Phe. Phe Asp Glu Asn Lys 



454 469 484 

AAT GIG GTA CIT AAA GTA TAT CCA AAC ATC ACA GTA GAG TCT TCT GCT' TCC AGA 
Asn Val Val Leu Lys Val Tyr Pro Asn MET Thr Val Glu Ser Cys Ala Cys Arg 

503 513 523 533 

TAACCIGGIG AAGAACTCAT CIGGATGCIT AACTCAATCG. 
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2. A gene encoding bovine BMP-3 having the amino acid sequence given in claim 1 . 

3. A gene encoding a protein exhibiting properties of BMP-3 and comprising a DNA sequence: 

(a) which differs from a DNA sequence of claim 1 in codon sequence due to the degeneracy of the 

5 genetic code; 

(b) which hybridises with a DNA sequence of claim 1 or section (a), above; or 

(c) represents a fragment, allelic or other variation of a DNA sequence of claim 1, whether said 

variation results in changes in the peptide sequence or not. . 

io 4. The DNA sequence of claim 3, which encodes human BMP-3. 

5. The DNA sequence of claim 3 or 4, which is a genomic DNA sequence. 

6. The DNA sequence of claim 3 or 4, which is a cDNA sequence. 

is 

7. A vector containing the gene or DNA sequence of any one of claims 1 to 6 in operative association with 
an expression control sequence; 

8. A cell transformed with a vector of claim 7. 

20 

9. The cell of claim 8 which is a mammalian cell, a bacterial cell, an insect cell or a yeast cell. 

10. The cell of claim 9 which is a CHO cell. 

25 11. A protein exhibiting properties of BMP-3 which is encoded by the gene or DNA sequence of any one of 

claims 1 to 6. 

12. A protein exhibiting properties of BMP-3 which is produced by the steps of culturing in a suitable 
culture medium a cell transformed with an expression vector comprising a gene or a DNA sequence of 

so any one of claims 1 to 6, and recovering said protein from said culture medium. 

13. A process for producing the protein of claim 11 or 12, comprising the steps of culturing in a suitable 

culture medium the cell of claim 9 and isolating said protein from said culture medium. 

35 14. A pharmaceutical composition comprising the protein of claim 11 or 12 and a pharmaceutically 

acceptable vehicle. 

15. The pharmaceutical composition of Claim 14, further comprising a matrix capable of delivering the 
composition to the site of the bone or cartilage defect and providing a structure for inducing bone or 

40 cartilage formation. 

16. The pharmaceutical composition of claim 15, wherein said matrix comprises hydroxyapatite, collagen, 
polylactic acid or tricalcium phosphate. 



45 17. Use of a protein of claim 11 or 12 for the preparation of a pharmaceutical composition for inducing 

bone or cartilage formation. 
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